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PURIFICATION DIRECTED CLONING OF PEPTIDES 
Background of the Invention 

In vitro DNA manipulation and the attendant 
transfer of genetic information have developed into a 
5 technology that allows the efficient expression of 
endogenous and foreign proteins in microbial hosts. 
Although expression of any foreign protein in any 
microbial host is theoretically possible, many problems 
limit its practice. Stability of the protein produced 
10 often limits such practice and results in a low yield. 
In particular, small foreign proteins and oligopeptides 
cannot be overproduced in most cellular hosts. While 
larger proteins can be produced at low or even high 
yield, their purification from the total cell extract 
15 often poses problems. 

To address these concerns, several researchers 
have recently investigated the fusion of a peptide 
marker to the protein being expressed so that the 
protein can be readily recognized during purification. 
20 For example, a method such as the one described in U.S. 
Patent No. 4,782,137, utilizes an antibody immobilized- 
immunoaff inity technique in combination with an 
expressed, fused protein containing a small antigenic 
oligopeptide marker having an "antigenic head" portion 
25 and a "linking tail" portion. The antigenic portion is 
a combination of hydrophilic cunino acids that readily 
elicit antigenic response. The linking portion permits 
cleavage of the marker from the desired protein after 
the fused protein has been isolated from the cells. 
30 Other methods involve expression and 

purification techniques combining the use of enzymatic 
markers such as glutathione transferase, beta- 
galactosidase and chloramphenicol acetyl transferase 
which bind to appropriate substrates. See D.B. Smith, 
35 et al.. Gene, 67, 31-40 (1988); Japanese application 
JF62283998, Derwent Abstract No. 4733853 and EFO 
application 131,363. In these instances, the enzyme is 
fused to the desired protein, the fusion product 
separated by binding to an immobilized substrate and the 
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enzyme cleaved from the desired protein by biochemical 
methods . 

Expression of a small polypeptide in a host 
cell also raises the possibility that the host will 
5 assimilate the polypeptide. In instances where the 
recognition marker and desired protein are small, e.g. 
below about 60 to 80 amino acid units in lengthy 
assimilation rather than e35>ression as an end-product 
usually occurs. 

10 The efficiency of expression in the host cell 

can also vary depending upon the character of the 
protein being produced. The host may produce negligible 
amounts of fused peptide because the variation of the 
fused peptide structure can trigger a decrease in the 

15 transcriptional and/or translational efficiency. In 
psarticular, triggering is a problem with a system that 
uses a small oligopeptide marker* 

Production of fusion proteins containing larger 
peptides exemplified by beta-galactosidase, glutathione 

20 transferase or chloramphenicol acetyl transferase 
generally creates a new set of difficulties. 
Purification usually is not very efficient or effective. 
For such large fusion proteins, the binding constants of 
the contained enzymes to their substrates usually are so 

25 low that purification is difficult to achieve. During 
purification the substrates often cleave from the column 
support and contaminate the separated fusion protein. 
Many of the enzymes are such large molecular weight 
proteins that the desired protein constitutes only a 

30 small fraction of the fusion protein. 

Consequently, production and purification of 
large quantities of protein by a microbial route need to 
be based upon techniques that avoid or alleviate the 
foregoing concerns. Overproduction of the desired 

35 protein relative to other proteins produced by the host 
would also be advantageous. The protein used for 
binding also should be one that has a high affinity for 
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a small molecule, so as to avoid the adverse complexing 
capacity, leakage, viral contamination and other 
problems associated with immunoaf finity chromatography. 

Therefore, it is an object of the invention to 
5 develop an expression system that allows the large 

scale, high yield overproduction of both small and large 
polypeptides. A further object is to provide a facile, 
highly efficient purification scheme that has general 
applicability to any product polypeptide such as 
10 proteins or very small peptides. Yet another object is 
the development of protein expression and separation 
methods that utilize inexpensive materials. A further 
object is practice of the method with soluble affinity 
systems that permit stable ^nd high yield expression of 
15 polypeptides. A particular object is the development of 
a separation system based upon affinity techniques 
employing a recombinant and expressed binding protein 
that has a high affinity for its ligand. 

20 Summary of the Invention 

These and other objects are achieved by the 
present invention which is directed to microbial and 
higher organism methods for expression and ligand 
immobilized affinity purification of a product peptide. 
25 The invention is directed as well to biological 

compositions for expressing and purifying the product 
peptide. In particular the invention includes a protein 
purification construct incorporating the product 
peptide • 

30 Generally, the biological compositions of the 

invention include; 

(a) the expressed protein purification construct 
having three tandem, coupled segments composed of a 
binding protein, an interconnecting peptide and a 

35 variable fused polypeptide; 

(b) a recombinant gene for the protein purification 
construct; 
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(c) a recombinant expression vector incorporating 
that recombinant gene; 

(d) transformed cells or higher organisms carrying 
the vector; 

5 (e) a gene for the binding protein fused to the 

interconnecting peptide; and 

(f) the binding protein fused to the interconnecting 
peptide. 

The protein purification construct of the 

10 present invention has a three tandem segment, single 
chain formula. The first segment is a binding protein 
which exhibits strong, reversible binding to a specific 
small molecular weight ligand. The second segment is an 
interconnecting peptide which is selectively cleavable 

15 by an enzyme or chemical technique. It is typically a 
short chain peptide. The third segment, the variable 
fused polypeptide, incorporates any natural or synthetic 
polypeptide desired as a product, e.g., any desired 
protein, oligopeptide or small molecular weight peptide. 

20 In general, the variable fused polypeptide is composed 
of a single product peptide, or multiple units of 
product peptides. The product peptides are the ultimate 
products of desirable character to the consumer. The 
variable fused polypeptide can incorporate naturally 

25 assimilable amino acids as well as optional lipid, 
glycan and other ancillary moieties such as vitamin 
derived cof actors and/or metal ions. 

The variable fused polypeptide has several 
forms. The first is a single product peptide. The 

30 second is composed of multiple tandem units of a single 
product peptide that are linked by an intraconnecting 
peptide. In this case, the intraconnecting peptide 
usually but not necessarily differs in structure and 
selectivity from the interconnecting peptide. When 

35 different, the intraconnecting peptide nevertheless has 
the same general function as the interconnecting peptide 
so that two different cleavages of an enzymatic or 
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chemical nature will sequentially cleave the variable 
fused polypeptide from the binding protein and then the 
Individual product peptides from each other. The third 
form Is a single unit composed of several (I.e., two or 
5 more) Identical or different product peptides tandemly 
interlinked together by innerconnecting peptides. The 
fourth form is composed of repeating multiple tandem 
units linked together by intraconnecting peptides 
wherein each unit contains the same series of different 
10 individual product peptides joined together by 

innerconnecting peptides. The fifth form is composed of 
a series of tandem units linked together by 
Intraconnecting peptides wherein each unit contains 
several identical or different product peptides joined 
15 by Innerconnecting peptides and the product peptides do 
not repeat from unit to unit. The sixth form is 
composed of identical multiple tandem units wherein each 
unit contains several identical product peptides joined 
by innerconnecting peptides. 
20 Preferred embodiments of the product peptide 

which can appear as single or multiple linked units in 
the variable fused polypeptide include caltrin, 
calcitonin, insulin, tissue plasminogen activator, 
growth hormone, growth factors, growth hormone releasing 
25 factors, erythropoietin, interferons, inter leukins , 
oxytocin, vasopressin, ACTH, collagen binding protein, 
assembled antibodies, individual heavy and light 
antibody chains, antibody fragments such as P.^, P(ab)2f 
Pc, individual chain fragments especially such as the 
30 isolated variable regions (VH or VL) as characterized by 
Lerner, Science ^ 246 , 1275 et. seq. (Dec. 1989) and 
epitopal regions such as those characterized by E. Ward 
et al.. Nature , 341 , 544-546 (1986) wherein the 
antibodies, chains, fragments and regions have natural 
35 or immunogenetically developed antigenicity toward 
antigenic substances. Additional embodiments of the 
desired polypeptide include polypeptides having 
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physiologic properties, such as sweetening peptides, 
mood altering polypeptides, nerve growth factors, 
regulatory proteins, functional hormones, enzymes, DNA 
polymerases, DNA modification enzymes, structural 
5 polypeptides, synthetic polypeptides, neuropeptides, 
polypeptides exhibiting effects upon the cardiovascular, 
respiratory, excretory, lymphatic, immune, blood, 
reproductive, cell stimulatory and physiologic 
functional systems, leukemia inhibitor factors, 

10 antibiotic and bacteriostatic peptides (such as 

cecropins, attacins, apidaecins), insecticide, herbicide 
and fungicide peptides as well as lysozymes. 

The binding protein segment of the protein 
purification construct generally is an enzyme-like 

15 protein including but not limited to an enzyme or a 
truncated, altered or modified functional version 
thereof (hereinafter the modified functional version) • 
The binding protein exhibits highly selective affinity 
binding to a low molecular weight ligand or a synthetic 

20 derivative thereof. The binding is strong so that, in 
general, the conjugation of the binding protein and 
ligand will exhibit a solution dissociation constant 
(inverse of the binding constant) of no more than about 
lO'^M. Generally, the ligand is a reversible inhibitor 

25 for the enzyme-like protein. Preferred embodiments of 
the binding protein include carbonic anhydrase derived 
from any source, especially mammalian or human, and a 
modified functional version thereof which will bind with 
the inhibitor, sulfanilamide or derivatives thereof. An 

30 especially preferred embodiment of the modified carbonic 
anhydrase enzyme is a functional version which (I) does 
not contain methionine, (II) has all or some glutamates 
replaced by another amino acid, preferably aspartate, 
(III) has all or some arginines replaced by another 

35 amino acid, preferably lysine, (IV) has all or some 

asparagines replaced by another amino acid, preferably 
glutamine, (V) has methionine replaced by another amino. 
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acid, preferably alanine or lysine, or (VI) has cysteine 
replaced by another amino acid, preferably serine. 

Antibodies or individual chains, regions or 
fragments thereof, as characterized above, and other 
5 proteins, which will strongly, biospecifically and 
reversibly bind to a low molecular weight ligand, can 
perform the same function in the same way to reach the 
same result as the enzyme-like protein in the context of 
the protein purification construct, and consequently are 
10 included within the invention as binding proteins. For 
antibodies or the corresponding chains, regions or 
fragments, the ligand is a low molecular weight antigen, 
preferably an aromatic moiety such as dinitrophenol . 

The choice of the 4.nterconnecting peptide for 
15 the protein purification construct depends upon the 

choice of cleavage enzyme and product peptide sequence. 
In general, this interconnecting peptide sequence 
constitutes any peptide sequence that uniquely reacts 
with a highly specific cleavage enzyme or by a highly 
20 specific chemical reagent cleavage. 

The recombinant gene coding for the protein 
purification construct incorporates three DNA segments 
coding for the peptide sequences of the three segments 
of the construct. The segments are arranged so that 
25 either the binding protein gene fragment or the variable 
fused polypeptide gene fragment can be read first. It 
is preferred to construct the protein purification 
construct gene so that the binding protein gene fragment 
is read first. The gene segments may be synthetic or 
30 derived from natural sources. 

The DNA sequence or sequences for the product 
peptide or peptides incorporates cDNA or genomic clones 
isolated from their native sources as well as synthetic 
DNA sequences. 
35 The expression vector containing the 

recombinant gene is capable of directing expression of 
the protein purification construct in prokaryotic or 
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eukaryotic cells. The expression vector incorporates 
the protein purification construct gene and base vector " 
segments such as the appropriate regulatory DNA 
sequences for transcription, translation, phenotyping, 
5 temporal or other control of expression, RNA binding and ; 
post-expression manipulation of the expressed product. 
The expression vector generally will include structxiral 
features such as a promoter, an operator, a regulatory 
sequence and a transcription termination signal. The 

10 expression vector can be synthesized from any base 

vector that is compatible with the host cell or higher 
organism and will provide the foregoing features. The 
regulatory sequences of the expression vector will be 
specifically compatible or^ adapted in some fashion to be 

15 compatible with prokaryotic or eukaryotic host cells or 
higher organisms. Post-expression regulatory sequences, 
which cause secretion of the protein purification 
construct can be included in the eukaryotic expression 
vector. It is especially preferred that the expression 

20 vector exhibit a stimulatory effect upon the host cell 
or higher organism such that the protein purification 
construct is overproduced relative to the usual 
biosynthetic expression of the host. 

Transformed prokaryotic or eukaryotic cells or 

25 higher organisms carrying the appropriate recombinant 
prokaryotic or eukaryotic vectors constitute the 
transformed microbes or higher organisms of this 
invention. The prokeurotic cells useful as hosts 
include any that are amenable to expression of foreign 

30 protein. Preferred embodiments include Ej_ coli and 
subtilis. The eukaryotic cells include unicellular 
organisms, such as yeast cells, as well as immortal 
cells from higher organisms, such as plant, insect or 
mammalian cells. Preferred eukaryotic cells include 

35 cerevisiae, Soodoptera frupiperda ^ and corn, tobacco or 
soybean plant cells. The higher organisms useful as 
hosts include higher order plants and animals having 
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germ cells that are amenable to transformation. 
Included are plants such as tobacco, corn, soybean and 
fruit bearing plants, and invertebrate and vertebrate 
animals such as fish, birds and mammals especially 
5 including sheepi goats, cows, horses and pigs. 

The invention as well includes a cultured, 
transformed cell colony or transformed plants or animals 
that are capable of expressing the protein purification 
construct . 

10 Generally, the method of the invention calls 

for: 

(a) host cell or host organism expression regulated 
by the recombinant expression vector carrying the 
recombinant gene coding for ^ the protein purification 

15 cons tame t; 

(b) purification of the protein purification 
construct by a ligand immobilized affinity separation 
technique; and 

(c) enzymatic or chemical cleavage of the protein 
20 purification construct in one or more stages to produce 

the product peptide. 

The expression steps of the method according to 
the present invention are based upon microbial or higher 
organism protein expression. The steps call for 

25 inserting the recombinant gene into an appropriate base 
vector, transforming host cells or higher organisms with 
the resulting recombinant vector and expressing the 
protein purification construct preferably as a soluble 
product within the host cell or higher organism, as a 

30 product that is insoluble in the cell cytoplasm, or as a 
secreted product by the host cell or higher organism. 
When higher organisms are chosen as the host, fertilized 
germ cells of that organism are transformed and the 
transformed organism grown through usual maturation 

35 techniques. 

The purification steps of the method call for 
affinity binding of the protein purification construct 
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to immobilized ligand, and separating it from other 
cellular constituents^ debris and culture medium. The " 
variable fused polypeptide is obtained from the 
immobilized protein purification construct through 
5 enzymatic or chemical cleavage action upon the 

interconnecting peptide , and separating the variable 
fused polypeptide from the cleavage enzyme or other 
material. (Throughout this application, mention of 
enzymatic or chemical cleavage alone will be understood 

10 to include both.) 

Alternatively, the purification steps can 
separate the entire protein purification construct from 
the immobilized ligand after purification and cleave it 
with an immobilized cleavage enzyme or chemical reagent 

15 to produce a mixture containing the variable fused 
polypeptide and binding protein. This mixture can be 
separated by use of an immobilized ligand for the 
binding protein and removal of the purified, variable 
fused polypeptide. 

20 When the variable fused polypeptide is a single 

product peptide, the foregoing separation completes the 
synthesis. When the variable fused polypeptide contains 
multiple units or a single unit of several identical of 
different product peptides as explained above, 

25 subsequent, sequential enzymatic or chemical cleavage of 
the intraconnecting peptides, and of the innerconnecting 
peptides, produces the product peptides in separated 
form . 

Generally, the three linking peptides - 
30 interconnecting peptide^ intraconnecting peptide and 
innerconnecting peptide - are chosen to be cleavable 
with different enzymes or chemicals- The combination of 
amino acid sequences in the binding protein and in these 
linking peptides are also chosen so that they do not 
35 duplicate amino acid sequences in the product peptides. 
The modifications of the binding protein mentioned above 
facilitate this choice. 
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Preferred embodiments of the method include . 
those expressing the recombinant gene composed of DNA 
segments for human carbonic anhydrase (or a modified 
functional version thereof), interconnecting peptide and 
5 a single product peptide or multiple units thereof. 

Additional preferred embodiments include use of coli 
or yeast as the host cells and use of controlled 
expression by means of any induction system such as 
temperature, nutrients, isopropyl thiogalactoside, 

10 indole acrylic acid, carbon sources and the like, so as 
to allow the production of a protein purification 
construct that would be toxic to the host. Further 
preferred embodiments include use of an expression 
vector system for prokaryotic cells which incorporates a 

15 two plasmid construction, and an expression vector 
system for yeast cells which incorporates a shuttle 
vector with an origin of replication for coli and one 
for Sj^ cerevisiae. 

An especially preferred method involves 

20 expression of multiple tandem units of a single product 
peptide from a single recombinant gene carrying 
multiples of the DNA sequence for the product peptide. 

Brief Description of the Drawings 

25 Figure 1 shows the various formulas for the 

variable fused polypeptides formed of multiple units of 

product peptides. 

Figure 2 shows several specific examples based 

upon some of the formulas of Figure 1. 
30 Figure 3 shows the amino acid sequence for a 

preferred binding protein, human carbonic anhydrase. 

Detailed Description of the Invention 

The expression of a foreign protein by 
35 interaction of recombinant vectors with the biosynthetic 
machinery of host cells or host organisms is a well- 
known technique for biochemical protein synthesis. The 
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present invention utilizfes a novel modification of this 
expression technique in combination with a ligand 
immobilized affinity separation technique to establish a 
new and unparalleled method for large scale, low cost, 
5 highly efficient biological synthesis. It eliminates 
expensive machinery and reagents, long synthetic times, 
low reaction efficiency and produces fewer faulty copies 
and a higher yield of the product peptide, than the 
solid phase peptide synthesis « 

10 The method of the invention incorporates a 

combination of cellular factors and biologic 
compositions that enable ready expression of foreign 
proteins by biological systems. Because the expressed 
protein purification construct incorporates a large 

15 binding protein, subversion of the particular identity 
of the product peptide by the expression mechanism of 
the cell or organism is not permitted. Variation in 
expression efficiency is minimized. The preferred 
inductive expression mechanism allows production of 

20 peptide purification constructs that would otherwise be 
toxic to host cells. 

The method of the invention also incorporates 
factors that contribute substantially to the efficiency, 
capacity and yield of the purification technique. The 

25 high binding constant (low dissociation constant) of the 
binding protein for its immobilized ligand enables 
clean, complete separation of the variable fused 
polypeptide from other constituents. The low molecular 
weight of the immobilized ligand achieves a high 

30 capacity and large scale of separation per unit weight 
of iiranobilization support. When the binding protein and 
ligand are an enzyme and inhibitor, there is no spurious 
enzymatic cleavage action to produce undesirable side 
products . 

35 Additional features of the invention that are 

advantageous include the use of the multiple unit 
concept and codon diversity in the gene segment for the 
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variable fused polypeptide. These features are based 
upon the discovery that repetitious gene sequences can 
be unstable in the host cell. 

5 lA. Method for Expression - Host Cells 

The microbial method of the invention employs 
host cells transformed with an expression vector 
carrying a recombinant protein purification construct 
gene which will cause the host cells to express the 
10 protein purification construct. The vector carrying the 
protein purification construct gene is prepared by 
insertion of the DNA segments coding for the protein 
purification construct into an appropriate base vector. 

In one scheme for construction of the vector, 

15 the DNA segment for the binding protein, for example the 
human gene for carbonic anhydrase II, (the binding 
protein gene) is inserted into a base plasmid which is 
compatible with the host cell to be transformed. The 
base plasmid contains the necessary regulatory sequences 

20 for high level expression of genes placed downstream. 

A synthetic DNA sequence coding for the 
interconnecting peptide is then inserted near the 3' end 
of the binding protein gene. A restriction enzyme site 
near the 3' end of the binding protein gene should be 

25 present to enable insertion of this DNA sequence for the 
inter-connecting peptide. Also, at least one convenient 
restriction enzyme site (intermediate vector restriction 
site) should be designed into the synthetic DNA sequence 
for the interconnecting peptide so that DNA segments 

30 coding for the variable fused polypeptide can later be 
inserted in the correct reading frame. If no such sites 
are already present, they can be introduced at this 
point in the scheme by a site-specific mutagenesis after 
standard procedures described in Sambrook, J., Fritsch, 

35 E.F. and Maniatis, T., Molecular Cloning, A Laboratory 
Manual f Cold Spring Harbor Laboratory, Cold Spring 
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Harbor, N.Y. (1989), the disclosure of which is 
incorporated herein by reference. 

The resulting vector construct is the 
intermediate base vector for the in site construction of 
5 the protein purification construct gene integrated into 
the larger vector. Any naturally occurring or synthetic 
DNA sequence can be inserted into the intermediate 
vector restriction site to yield a protein purification 
construct gene integrated into the expression vector - 

10 Proper insertion and reading frame alignment can be 
verified by known techniques such as sequencing the 
junction region between the binding protein gene and the 
DNA sequence for the variable fused polypeptide 
according to methods described in Sambrook et al. 

15 Alternate synthetic routes also will provide 

the same construction of the final expression vector. 
In one alternative route , any two adjacent DNA segments 
of the pixjtein purification construct gene can be 
ligated together to form an intermediate gene and then 

20 combined with the third segment to form the protein 
purification construct gene in non-integrated form 
(i.e., separate entity). The non- integrated protein 
purification construct gene can then be inserted into an 
appropriate base vector following the restriction and 

25 ligation methods described above. Appropriate unique 
restriction sites can be inserted as described above 
(Sambrook protocols) if not already present in the base 
vector chosen. 

In a second alternative, after ligating 

30 together any two adjacent DNA segments, the resulting 
intermediate gene can be transferred to the base vector 
by the restriction and ligation methods described above. 
The third DNA segment (i.e., the binding protein gene or 
variable fused polypeptide gene) can be inserted into 

35 the base vector carrying the intermediate gene pursuant 
to the Sambrook techniques including construction of 
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procedures described above. 

This same methodology for final expression 
vector construction can be employed in the preparation 
5 of vectors carrying protein purification construct genes 
reading for protein purification constructs of a reverse 
sequence, i.e., reading as variable fused peptide- 
interconnecting peptide-binding protein. In this 
instance, the gene for the interconnecting peptide 
10 and/or other appropriate genes are inserted at the 5 ' 

end of the binding protein gene and/or other appropriate 
gene. All protocols for restriction, insertion, 
ligation and the like follow standard procedures such as 
those described by Sambrooki, cited supra . 
15 The final recombinant expression vector will 

carry an appropriate promoter, a sequence coding for a 
ribosome binding site, phenotype genes for selection, 
and regulatory regions for transcription, translation 
and for post-translational intracellular manipulation of 
20 the resulting protein purification construct. 

Prokaryotic and eukaryotic vectors which have 
been constructed according to the foregoing general 
schemes can function or can be modified to function 
within the biosynthetic machinery of appropriate host 
25 cells. For eukaryotic cells, the vectors preferably can 
carry signal sequences and/or other regulatory sequences 
to allow for excellular secretion of the protein 
purification construct as it is expressed or allow for 
post-translational modifications . 
30 Preferred base vectors for the method of 

invention include any plasmid that is compatible with 
the particular host, is stable in that host and allows 
for a positive selection of the transformed host. Such 
vectors include, for example, pT21B/l9U/R or pPL-lambda 
35 as well as those characterized in P.H. Pouwels, B.E. 

Enger-Valk, and W.J. Branimer, Cloning Vectors , Elsevier 
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Science Pub. (1985) the disclosure of which is 
incorporated herein by reference.. 

The preferred intermediate vector restriction 
sites will depend on the DMA sequence of the 
5 interconnecting peptide^ which in turn is dependent on . 
the type of cleavage chosen. Any restriction enzyme 
site can be used which is compatible with the DMA 
sequence for the interconnecting peptide, or which can 
be manipulated by treatment with DNA modifying enzymes . 

10 (such as T4 DNA polymerase, SI nuclease, Klenow 

fragment, mung bean nuclease or different exonucleases ) 
or by the addition of linkers or adapters to ensure a 
correct sequence and reading frame alignment in the 
junction region. 

15 Isolation of host cells transformed with the 

final recombinant expression vector is accomplished by 
selecting for the phenotype or other characteristic that 
is designed into the recombinant vector. Generally, 
such selection characteristics include antibiotic 

20 resistance or complementation of deficient functions in 
the host. Preferred phenotype genes for the recombinant 
vector of the invention include antibiotic resistent 
phenotypes, essential amino acid phenotypes and other 
essential compound phenotypes. 

25 Culturing the selected, transformed host cells 

in a medium containing nutrients, growth factors and 
vitamins will yield the protein purification construct 
in those cases where a non- inducible expression system 
is used. Preferably, an inducible expression system is 

30 used so that the selected, transformed host cell will be 
grown to eeirly- to mid- logarithmic phase and treated 
with an induction compound to cause the protein 
purification construct to be produced. Typically, 
incubation will be continued for up to several hours 

35 (the most appropriate incubation time for each protein 
purification construct is determined by sampling at 
differing times during a test incubation) , at which 
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point the cells are harvested and lysed to release the 
cellular contents containing the protein purification 
construct. If the transformed host cell is designed to 
secrete the protein purification construct, the culture 
5 is grown until an appropriate and/or desired 

concentration of the protein purification construct is 
present in the culture medium. If the host cell is one 
that will contain dissolved protein purification 
construct in its cytoplasm, the culture is grown until 

10 it reaches optimum maturity. The mature culture is then 
lysed with an appropriate agent to release the cytoplasm 
containing the protein purification construct. If the 
protein purification construct is deposited as insoluble 
granules in the host cell, 4:he mature cell culture, 

15 lysed and the released insoliible granules are dissolved 
in chaotropic agents and refolded in vitro > This 
incubation, growth and lysing process can be conducted 
in a batch or continuous manner. 

In either alternative for producing protein 

20 purification construct, the medium containing the 

protein purification construct obtained from the cells 
is taken as a crude mixture of materials into the method 
for purification. 

25 IB, Method for Expression - Higher Organisms 

The multicellular method of the invention 
employs higher organisms transgenically altered with a 
genetic expression cassette carrying a recombinant 
protein purification construct gene which will cause the 

30 higher organism to express the protein purification 
construct. The cassette carrying the protein 
purification construct gene is prepared by insertion of 
the DNA segments coding for the protein purification 
construct into an appropriate base cassette. The base 

35 cassette will be compatible with host organism germ 

cells and will be designed to combine appropriately with . 
the genetic material of those cells. It will carry or 
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will be modified to carry the appropriate promoter,* 
operator, regulatory and phenotype regions needed to 
achieve transcription, translation/ and post-translation- 
intracellular manipulation of the resulting protein 
5 purification construct. 

Higher organisms, including pllants such as 
com, soybean and tobacco, and animals such as fish or 
mammals (e.g., goats, cows, horses, rabbits and mice) 
can function as the hosts for this method. After sexual 

10 fusion of host haploid germ cells and transformation of 
those fertilized germ cells with the recombinant 
expression cassette, the transgenic animal can be grown 
by reimplantation of the transgenic germ cell into the 
uterus and growth of the transgenic embryo- Plants also 

15 can function as the host of this method. Transgenic 
plants can be obtained by Agrobacterium mediated 
transformation, electroporation of plant protoplasts or 
microprojectile bombardment. Upon maturation of the 
transgenic plant or animal, the expressed protein 

20 purification construct can be found in the plant 
material, e.g., leaves, seeds, fruit, or in animal 
tissues, e.g., meat, organs, or in tissue secretions, 
e.g., sap, urine or milk. Homogenization and/or 
extractive concentration methods applied to the 

25 material, tissues or secretions can yield a crude 

mixture of materials including the protein purification 
construct for use in the method for purification. 
Generally, these methods follow the techniques and 
protocols described in the references in Table 1, the 

30 disclosures of which are incorporated herein by 
reference. 
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Table 1 

Protocols for preparation of 
transgenic sheep and mice 

1. Gordon K, , Lee L., Vitale J. A., Smith A.E., Westphal 
5 H., and Hennighausen L. (1987) Bio /Technology . 5, 

1183-1187. 

2. Pittius C.W., Hennighausen L., Lee L., Westphal H., 
Nichols E., Vitale J. A., and Gordon K. (1988) Proc. 
Natl. Acad. Sci. USA . 85, 5874-5878. 

10 3. Clark A.J., Bassos H., Bishop J.D., Brown P., Harris 
S., Lathe R., McLenaghan M., Prouse C, and Simons 
J. P. (1989) Bio /Technology . 1, 487-492, 

Protocols for preparation of transgenic rabbits 
15 1. Buhler Th.A., Bruyere Th., Went D.F., Stranzinger 

G., and Burki K. (1990) Bio /Technology , 8, 140-143. 

Protocols for preparation of transgenic plants 

1. Hooykaas P.J.J. , and Schilperoort R.A. (1987) 
20 Methods in Enzymology . 153 , 305-313. 

2. Shillito R.D., and Potrykus I., (1987) Methods in 
Enzymology , 153 , 313-336. 

3. Weis singer A., Thomes D., Maddock S., Fromm M., and 
Sanford J. (1988) in Current Communications in 

25 Molecular Biology , (Fraley R.T., Frey N.M., and 

Schell J., eds.) Cold Spring Harbor, NY. 

4. Lichtenstein C, and Draper J. (1986) in DNA 
Cloning , vol. II, (Glover D.M. ed.), IRL Press Ltd., 
Oxford UK. 



30 
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II. Method for Purification 

To perform piirif ication, the crude mixture of 
materials is combined with an immobilized ligand for the 
binding protein. Examples of the binding protein, 
5 corresponding ligand and associated dissociation 
constants are given in following Table 2. For the 
preferred carbonic anhydrase enzyme, the ligand is 
sulfanilamide or a benzene sulfonamide derivative with a 
dissociation constant of no greater than 10'\ For 

10 preferred avidin/streptavidin the ligand is the biotin 
analog 2 -amino biotin with a dissociation constant of on 
the order of biotins but which dissociate when the pH is 
raised and the ligand associates with a proton. The 
protein purification construct binds to the immobilized 

15 ligand through the reversible affinity of the binding 
protein for its ligand. The remaining constituents and 
debris of the crude mixture of materials can then be 
removed by washing or similar techniques. 
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Table 2 



Binding Protein 
Xanthine Oxidase 
5 Adenosine deaminase 
Adenosine deaminase 
Adenosine deaminase 

Dihydrofolate reductase 

10 Dihydrofolate reductase 

Dihydrofolate reductase 

Dihydrofolate reductase 

Ribulose bisphosphate 
carboxylase 

15 Pepsin 

Calmodulin 

Calmodulin 

Cholesterol esterase 

Carbonic anhydrase II 
20 Carbonic anhydrase II 

Hemoglobin 
ATPase 

ATPase 



Ligand Kd 

Allopurinol strong 

Coformycin <1.2£-10 

Deoxycof ormycin 2 • SE.12 

erythro-9- {2-hydroxy-3 1 . 6E-9 
nonyl) adenine 

Methotrexate 1 . 2£-9 

Methotrexate 2 . 3E-9 

Aminopterin 3.7E-9 

Trimethoprin 4 . 6E-9 

2 carboxyarabirital lE-14 
1,5 bisphosphate 

Pepstatiii lOE-9 

Melittin 3E-9 

Various peptides 0.2E-9 

Borinic acid O.lE-9 

Sulfanilamide 4 . 6E-7 

Acetazolamide 6 E-10 

Cyanide 

AMPPNP 

Vanadate 



ref. 
1 
1 
2 
2 

4 
5 
5 
5 
6 



7 
7 
8 
3 
3 



25 



£ is times ten to the negative exponent indicated. 



References Cited in Table 2 



30 



35 



1. 
2. 
3. 
4. 
5. 
6. 



Cha et al., Biochemical Pharm. . 24, 2187-2197. 
Agarval et al., Biochemical Pharm. . 26 . 354-367 (1977). 
Taylor, P.V. et al., Biochemistry . 9, 2638 (1970). 
Cha et al,, Biochemical Pharm. . 30, 1507-1515 (1981). 
Williams et al., Biochemical Pharm. . 29, 589-595 (1980), 



Pierce, J., Tolbert, N.E 
-942 (1980). 

Degrado et al., J. of Cellular Biochem. . 
(1989). 

Sutton et al., BBRC, 134, 386-392 (1986) 



Barker, R. , Biochem. . 19 . 934 
29, 83-93 
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Immobilization of the ligand on a solid support 
can be accomplished by the methods- of W. Scouter, 
Methods Enzvmol, . 34/ 288-294 (1974); S. Marcus, Methods 
5 Enzvmol. . 34/ 377-385 (1974); A. Matsura et al*. Methods 
Enzvmol. t 34/ 303-4 (1974); R. Barker, Methods Enzymol., 
34^ 317-328 (1974); I. Matsumoto, Methods E nzvmol > , 34/ 
324-341 (1974), J. Johansen, Carlsbera Res . Commun . , 14, 
73 (1976) and G. S. Bethell et al., J. Biol. Chem^ 2S4, 

10 2572-2574 (1979); the disclosures of which are 

incorporated herein by reference. Preferred solid 
supports include cellulose derivatives, polyamides, 
dextrin, gels, polystyrene, polyolefins, silicas and 
glasses. Alternatively/ the ligand may be coupled to an 

15 iron containing molecule such as a ferrocenyl derivative 
and a magnet used to perform the immobilization and 
separation of the iron bound protein purification 
construct. Liquid - liquid affinity purification can 
also be used. In this instance the ligand is bound to a 

20 substance that conveys selective solubility of the 

protein purification construct in a particular solvent 
system relative to the other materials present in the 
crude mixture. 

Two routes can be employed for further 

25 purification of the protein purification construct. 

According to the first route, the protein purification 
construct is dissociated intact from the immobilized 
ligand by washing with a strong competing ligand 
solution. Examples include cyanides, pseudocyanides 

30 such as thiocyanides , perchlorates , halide and similar 
strong Lewis bases. According to the second route, the 
immobilized protein purification construct is contacted 
directly with cleavage reagent to release the variable 
fused polypeptide. 

35 To isolate the variable fused polypeptide in 

the first route, the purified protein purification 
construct is contacted with cleavage reagent 
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(immobilized or soluble) to cleave the protein 
purification construct into binding protein- 
interconnecting peptide moiety and the variable fused 
polypeptide. Contact of this mixture with the 
5 immobilized ligand for the binding protein will bind the 
binding protein-interconnecting peptide moiety and allow 
the removal of the cleaved variable fused polypeptide. 
If the cleavage reagent is soluble, it can be separated 
from the cleaved variable fused polypeptide by ordinary 

10 known techniques such as partition chromatography, 
dialysis, filtration based upon molecular size, high 
pressure liquid chromatography ion exchange 
chromatography and the like. 

To isolate the vai^iable fused polypeptide in 

15 the second route, its mixture with cleavage enzyme may 
be combined with a means for molecular weight selection 
(e.g. partition chromatography dialysis, filtration 
based on molecular size or high pressure liquid 
chromatography on a "particle exclusion" base or ion 

20 exchange chromatography) such that the high molecular 
weight cleavage enzyme is separated fr-'^m the free 
variable fused peptide. Or, the mixture can be combined 
with an immobilized affinity material for the cleavage 
enzyme . 

25 Depending upon the expression gene chosen, the 

free variable fused polypeptide released from the 
interconnecting peptide cleavage will constitute: 

(a) the product peptide in isolated form, or 

(b) multiple units of product peptide(s) that need 
30 further manipulation. 

Several forms of the multiple units are 
possible as mentioned above. In a first form, the 
multiple units may constitute a series of identical 
product peptides linked together by intraconnecting 
35 peptide fragments that differ in structure and 

selectivity from the interconnecting peptide already 
cleaved. In this form, enzymatic cleavage of the 
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intraconnecting peptide yields free multiple copies of 
the product peptide. Especially when the product " . . 

peptide is composed of one hundred or less amino acids 
this multiple unit method is an efficient way to convert 
5 a major portion of the weight of the cellular protein 
products into the product peptide 

In another form, the multiple units may be the 
same repeating unit of a series of different product 
peptides or non-repeating units composed of several 

10 identical or different product peptides. The units are 
linked together by the above-mentioned intraconnecting 
peptide while the different product peptides within the 
unit are linked together by innerconnecting peptides. 
The different product peptides are selected so that 

15 their physical or chemical properties permit their 
separation by conventional or affinity means. To 
facilitate individual product peptide separation, the 
intraconnecting and innerconnecting peptides serve not 
only as sites for peptide cleavage but also the 

20 resulting cleavage residues contribute to the chemical 
or physical differentiation which permits separation of 
individual product peptides. The ionic, lipophobic, 
acidic, basic or antigenic function of the 
intraconnecting and innerconnecting peptide residue 

25 alone or in combination with the attached product 
peptide permits separation of individual product 
peptides after cleavage by means such as affinity 
chromatography, ion exchange chromatography, reverse or 
normal phase chromatography, coiinter current 

30 distribution, acid or base extraction and other known 
methods for peptide separation. In operation, the 
intraconnecting and innerconnecting peptides are 
designed so that they may be effectively and selectively 
cleaved by enzymatic means and the resulting cleavage 

35 residue can be effectively and efficiently removed from 
the product peptide by chemical or enzymatic techniques. 
Typically, the intraconnecting and innerconnecting 
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peptide cleavage residues can be removed from the 
product peptides by appropriate enzymatic digestion with 
a carboxypeptldase or aminopeptidase . 

Generally, the interconnecting peptide, 
Intraconnecting peptide and Innerconnecting peptide 
fragments will have different amino acid sequences so 
that they can be sequentially rather than simultaneously 
cleaved. The amino acid sequences are chosen also so 
that the cleavage sequence does not duplicate any amino 
acid sequence of the product peptide(s). These three 
peptide connecting fragments can be chosen from the same 
group of amino acid unit sequences for example, those 
listed in following Table 3. The factors to consider in 
choosing these three peptidq connecting fragments 
include the following: 

a) The amino acid sequence of the product peptides. 
The connecting fragments cannot contain amino acid 
sequences that are in the product peptides . 

b) Fragments with high cleavage specificity should be 
cleaved first while those with lower specificity 
should be cleaved later* This factor means that 
cleavage of the interconnecting peptide should be 
practiced according to a plan that renders this 
cleavage highly selective relative to the amino acid 
sequence of the variable fused polypeptide. Medium 
and low specificity can be used with the intra and 
innerconnecting peptide respectively. For example, 
methionine cleavage, which has low specificity, is 
most appropriately to use with the innerconnecting 
peptides . 

c) The binding protein cannot contain readily 
accessible amino acid sequences that also constitute 
the interconnecting peptide sequence. The binding 
protein amino acid sequence can be modified as 
described above to eliminate such overlap and 
increase design flexibility. 
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d) The intra and innerconnecting peptides and the gene 
fragments coding for them are positioned and altered" 
to provide for diversity in the gene sequence for 
the variable fused peptide. This diversity allows 
5 efficient eiqpression of multiple units of a small 

peptide. It has been discovered that a continuously = 
repetitive genetic sequence will often be rearranged 
or deleted by the host organism prior to 
recombination . 
j^O The several versions of the protein 

pxirification construct based upon the variable fused 
polypeptide are depicted in Figure 1. Formula I 
illustrates a straightforward version of the protein 
purification construct as a three segment combination of 
15 the binding protein (BP), interconnecting peptide 
(INTERP) and product peptide (PP) . Formula II 
illustrates a multiple unit (U) version of the protein 
purification construct in which the variable fused 
polypeptide is composed of multiple tandem units of 
20 identical PP linked together by intraconnecting peptides 
(INTRAP). Formula III illustrates multiple units linked 
together by INTRAP' s, the units containing two different- 
pp's interlinked by an innerconnecting peptide (INERP). 
Formula IV illustrates multiple units in which each unit 
25 is a series of three PP's interlinked by INERP' s. 

Formula V illustrates multiple units wherein each unit 
in the sequence contains a different series of three 
identical PP's. Formula VI illustrates multiple units 
wherein each unit contains the same series of PP. 
30 For illustrative piirposes, examples of Formulas 

I, II, III and IV of Figure 1 using specific product 
peptides are shown in Figure 2. 

Figure 2A is based upon Formula II of Figure 1 
and shows the structure for a protein purification 
35 construct formed from modified human carbonic anhydrase 
(methionine elimination) as the binding protein and 
multiple units of Fibrino Peptide B (human) as the 
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variable fused polypeptide. The multiple units are 
linked by methionine. The variable fused polypeptide Is 
linked to the binding protein by a tetraaspartyllyslne 
Interconnecting peptide. Selective cleavage of the 
5 connecting peptides with enteroklnase (cleaves INTERP) 
and then with trypsin (cleaves at the carboxy side of 
arglnlne) produces N-Methlonlne Fibrlno Peptide B. 

Figure 2B is based upon Formula III of Figure 1 
and shows the structure for a protein purification 
10 construct formed from modified human carbonic anhydrase 
(methionine elimination) as the binding protein and 
multiple units of Fibrlno Protein B (human, PP^) and 
llpotropln (PP2) joined by a dlalanylarginine 
innerconnecting peptide. Tjie multiple units are linked 
15 by a lyslnylalanlnylmethionine Intraconnectlng peptide. 
The Interconnecting peptide is tetraaspartyllyslne. As 
shown sequential cleavage with enteroklnase (INTERP 
cleavage), then cyanogen bromide (INTRAP cleavage), and 
then trypsin (INERP cleavage) followed by ion exchange 
20 chromatography and carboxypeptidase digestion of the 
connecting peptide residues produces the free Fibrlno 
Peptide B and llpotropln. 

Figure 2C is based upon Formula V of Figure 1 
and shows the structure for a protein purification 
25 construct formed from modified human carbonic anhydrase 
(methionine elimination) as the binding protein and two 
repeating multiple units, the first being repetitious of 
Fibrlno Protein B (human) innerconnected by dlleuclne 
and the second being repetitious of lipotropic 
30 interconnected by arginine. The multiple units are 
linked by alaninylmethionine intraconnectlng peptide ► 
The interconnecting peptide is tetraaspartyllyslne. As 
shown, sequential cleavage with enteroklnase* (INTERP 
cleavage) then cyanogen bromide (INTRAP cleavage) 
35 followed by ion exchange chromatography and trypsin or 
aminopeptidase digestion to remove the connecting 
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peptide residues produces the free Pibrino Peptide B and 

lipo tropin. 

When the product peptide contains an amide 
group instead of carboxylic acid at its C-terminus, the 
5 synthetic method of the invention can be altered and 
extended to produce that amide substituted protein by 
the addition of one or more amino acids to the c- 
terminus. In one embodiment of this method, the DNA 
code for an amino acid, e.g., glycine, can be added to 

10 the C-terminus of the gene for the product peptide. The 
product peptide is expressed with this additional amino 
acid fragment at the C-terminus. Use of the amidating 
enzyme described in the publications A.F. Bradbury, D.G. 
Smyth, Biochem. & Biophvs Res . Com., 15i, 1293-1300 

15 (1988); Kazuhiro Ohsuye et al., Biochem. 6 Biophvs. Res. 
Com. , 150 , 1275-1281 (1988) will cleave the C-terminus 
amino acid between the N and CHj groups so as to leave a 
primary amide group. Other methods based upon C- 
terminus amino acid rearrangement and cleavage are also 

20 known and available for formation of the C-terminus 
amide group. 

The method of the present invention circumvents 
the synthetic and purification difficulties of prior 
marker, antibody and enzyme recombinant techniques for 

25 polypeptide recombinant synthesis. Por instance, the 
first segment of the protein purification construct is 
large enough to reduce any significant effects on the 
translational efficiency that could be caused by the 
variation of the sequence of the variable fused 

30 polypeptide. The use of a low molecular weight ligand 
and a binding protein having a strong binding constant 
for the ligand avoids low concentrations of the binding: 
agent on the column and inefficient separations that are 
attendant with antibody and substrate-enzyme techniques. 

35 A highly leveraged binding capacity is produced for the 
separation as a result of the small size of the ligand 
relative to the large protein purification construct. 



WO92/01707 



PCr/US91/(M511 



29 

The present technique also provides an inexpensive 
affinity ligand (inhibitor) and indefinite reuse of the 
purification column. Further, the' method incorporating 
the preferred synthetic gene and expression vector 
5 allows overproduction of short peptides otherwise 
degraded in bacterial cells or fungi cells. 

The method of the invention preferably employs 
a large DNA segment for the binding protein at the 
expression start site of the vector. With this 
10 preferred arrangement, the host cell reads the 5 '-end 
codons of the recombinant vector as being the same in 
all instances even though the DNA sequence at the 3'-*end 
of the gene can differ. With this preferred 
arrangement, expression is less dependent upon the 
15 variability of sequence of the gene segments for the 
variable fused polypeptide (product peptide) , does not 
undergo reversion because of a repetitive product 
peptide sequence and relies upon the constancy of the 
sequence of the gene segment for the binding protein. 
20 This arrangement results in a more consistent expression 
system for any product peptide. 

In particular, for multiple unit expression, it 
has been discovered that this preferred arrangement 
causes significant recombinant cell tolerance for the 
25 repetitive character of the variable fused protein 

containing more than one copy of a product peptide, and 
the corresponding inserted gene. Therefore, a variable 
fused polypeptide containing multiple units of product 
peptide is effectively and efficiently produced by this 
30 invention. In combination with the secpience variability 
provided by the multiple unit construct described above, 
this preferred arrangement further increases the product 
yield of the recombinant expression according to the 
invention. 

35 The method of the invention also allows the 

synthesis of oligopeptides which cannot be prepared by a 
recombinant technique that incorporates a small 
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antigenic peptide as an antibody separation marker. 
Oligopeptides with less than 50 amino acids are 
typically very unstable in microbes such as coli or 
yeast. The addition of an antigenic peptide marker 
5 containing, for example, eight amino acid units will not 
alter the instability. Hence, oligopeptide 
biosynthesis, which, for example, would include the 
biosynthesis of calcitonin, proinsulin, vasopressin, 
angiotensin, caltrin, LHRH, growth hormone releasing 

10 factor, cecropins, apidaecins and the like, cannot be 
accomplished through application of such a technique, 
The method of the invention also avoids the 
difficulties attendant to techniques for expressing 
large proteins fused to short peptide markers. In these 

15 instances, the large proteins can fold back and block 
the short peptide marker. The marker would then be 
unavailable for complexation with, for example, an 
immobilized antibody. Also, use of a short leader 
peptide marker can cause variations in the overall 

20 expression efficiency, because the translational 

efficiency will become dependent on the DNA sequence of • 
the large protein. 

The method of the invention further employs a 
biospecific, ligand-binding protein affinity 

25 purification that separates large volumes of highly 

concentrated protein purification construct. This has a 
distinct advantage over antibody immobilized affinity 
chromatography methods where the size and molecular 
weight of the antibody are limiting factors for the 

30 amount of antigenic material (e.g. expressed protein) 
that can be complexed with a given unit weight of 
immobilized antibody and a given unit volume of affinity 
resin. In addition, the method of the invention avoids 
antibody leakage, microbial and viral contamination and 

35 other problems associated with inmunoaf finity 
chromatography . 



wo 92/01707 



PCr/US91/04511 



31 

The binding of ligand to binding protein 
according to the invention is highly specific so that 
there is significant discrimination against ^^undesired 
side products. The binding strength of this conjugate 
5 is quite high, on the order of approximately IC'm or 
less as measured by its inverse function, the 
dissociation constant. This strength means that non- 
specific binding of undesired side products can be 
eliminated by treatment with a mild debinding agent in 
10 contrast with what would happen with systems of lower 
binding strengths. Because the binding constant 
operating in the present invention is high, relatively 
powerful washing and/or dissolution systems can be 
adapted to the purification ^technique. For example, 
15 washing with IM aqueous solution of a mineral acid salt 
of a metallic ion, will not dissociate the binding 
protein-ligand couple on the support matrix in all 
instances according to the invention. Because the 
ligand-binding protein couple occupies a small volume 
20 relative to antibody-antigen volumes, however, an 

extremely high separation capacity per unit volume of 
immobilizing medium is possible. 

When the ligand and binding protein couple is 
the especially preferred embodiment, inhibitor-enzyme, 
25 its use, rather than a substrate-enzyme conjugate 

permits achievement of a high binding strength rivaling 
that of the antibody-antigen affinity. Moreover, lack 
of a substrate conjugate negates the drawback of 
substrate based systems wherein the substrate can 
30 undergo reaction upon enzyme conjugation. 

III. Preferred Embodiments 

According to the present invention, a synthetic 
or cDNA gene coding for the product peptide and a DNA 
35 sequence for the interconnecting peptide are cloned 
directly adjacent to the hCAII gene presented in a 
recombinant vector. The resulting recombinant 
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expression system preferably allows overproduction of 
the protein pxirification construct. The protein 
purification construct can then be purified through the 
inhibitor affinity function of the hCAlI moiety. 
5 Generally, any product peptide can be 

overproduced and purified by the method of the 
invention. The method is especially useful for 
overproduction of oligopeptides, which would be unstable 
in a "non- fused" form in the cellular host. 

10 Specific embodiments for practice of the 

invention according to the preferred method employing 
carbonic anhydrase as the binding protein will further 
illustrate the advantages of the present invention. The 
illustrated combinations and compositions for practice 

15 of the method are based upon human carbonic anhydrase 
II. Additional embodiments of the ligand and binding 
protein included within the invention: 

a) dinitrophenol as the ligand and the F^i, of the 
corresponding immunospecif ic monoclonal 

20 antibody; 

b) succinate as the ligand and carboxypeptidase T 

(preferred); and 

c) p-aminobenzoyl glutamate and carboxypeptidase G 
(prefeirred) . 
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III. A. Biologica l Compositions 



DMA and Amino Acid Sequences for 
Protein Purification Construct 

a. Binding Protein Segment 

30 The DNA euid amino acid sequence for the 

especially preferred carbonic anhydrase binding protein, 

human carbonic anhydrase II (hCAII), is given in Figure 

3. When employing the hcAII gene, at least a portion 

representing the functional fragment of the enzyme, such 

35 as but not limited to carbonic anhydrase terminated at 

methionine 239, cysteine 205 or asparagine 231 and 

ceurbonic anhydrase modified as follows: 
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i. hCAII with all or some glutamate amino acid 
residues replaced by a different amino acid 
(AA), preferably aspartate, 
ii. hCAII with all or some arginine AA residues 
5 replaced by a different AA, preferably lysine, 

iii. hCAII with the AA positions at [NllX, G12X] 
(asparagine glycine), [N62X, G623X], [N231X, 
G232X], M240X (methionine), or C205X (cysteine) 
modified as follows: the asparagine is changed 
10 to glutamine or glycine is changed to alanine, 

methionine is changed to alanine or leucine, 
and cysteine is changed to serine, or a 
combination of any of these. 
In particular, the complete ^hCA gene sequence can be 
15 inserted into an expression vector which is compatible 
with Ej^ coli. Cleavage of the DNA sequence at a site 
downstream from the regulatory portion of the vector 
followed by insertion of the gene through blunt- or 
sticky-end ligation forms the recombinant vector. 
20 Additionally, an optional restriction site sequence at 
the 5 ' or 3 ' end of the enzyme gene can be introduced so 
that subsequent manipulation can be made. 

III. A.l. b. Inter- Inner- and Intraconnectlno 
25 Peptide Segments 

A short DNA fragment coding for the inter- 
connecting peptide is inserted near the 3' or 5' end of 
the intact or partial hCA gene (the inner- and 
intraconnecting peptides are discussed below) . This 

30 fragment can be inserted before or after the hCA gene or 
fragment has been transferred to the expression vector. 
The accuracy of the reading frame can be ensured since 
the nucleotide sequence of the hCA gene is exactly 
known. Upon insertion, however, the DNA sequence in the 

35 junction region is verified using standard DNA 

sequencing methods described in Sambrook et al., cited 
supra . 
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Several DNA and peptide sequences for the 
inter-/ inner- and intraconnecting peptides and their 
corresponding cleavage enzymes or chemical cleavage 
conditions are possible. The preferred sequences and 
5 enzymes are given in following Table 3. The gene 
sequence indicated is one possibility coding for the 
inter-, inner- and intraconnecting peptides. Other DNA 
sequences can be constructed to code for the same inter- 
, inner- or intraconnecting peptide sequence. 



10 
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15 



Enzymes 

for Cleavage 

Enterokinase 

Factor Xa 

Thrombin 

Ubiguitin Cleaving 
Enzyme 



Renin 

T6CTGGTTTAT 
20 Trypsin 

Chymotrypsin 

25 Clostripain 
S. aureus V8 



35 
Table 3 

Inter-, Inner- or 

Intraconnecting 

Peptide 

(Asp)4Lys 

IleGluGlyArg 
ArgGlyProArg 
ArgGlyGly 

HisProPheHisLeu- 
LeuValTyr 

Lys or Arg 

Phe or Tyr or Trp 

Arg 
Glu 



DNA Sea, 
GACGACGACGATAAA 
ATTGAAGGAAGA 
AGAGGACCAAGA 
A6AG6AGGA 

CATCCTTTTCATC- 



AAA OR C6T 

TTT or TAT or 
TG6 

CGT 
GAA 



30 



35 



40 



45 



50 



Chemical 
Cleavage 

(at pH3) 

( Hydroxylamine ) 

(CNBr) 

BNPS-skatole 

2-Nitro-5- 
thiocyanobenzoate 



Inter-, Inner- or 
Intraconnecting 

Peptide 

AspGly or AspPro 

AsnGly 

Methionine 

Trp 

Cys 



DNA Seg, 
GATGGA 
AATCCA 
ATG 
TGG 
TGT 



III. A.l c. Variable Fused Peptide Segment 

The third part of the sequence consists of a 
DNA sequence coding for the variable fused polypeptide 
which in a preferred embodiment is a single product 
peptide or multiple units of product peptide. The unit 
can in turn contain one or two product peptides. To 
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form the single product peptide, the single copy gene is 
ligated to a DNA segment for an interconnecting peptide 
selected from the foregoing list. To form the double 
product peptide incorporated into multiple units, the 
5 combination of DNA sequences for the two product 

peptides, which are linked together by a DNA sequence 
for the innerconnecting peptide, is linked to a DNA 
sequence for an intraconnecting peptide selected from 
the foregoing list. Through repetitive cloning, 
10 multiple units of the gene can be combined together. 
This multiple unit gene then constitutes the code for 
varieible fused peptide segment. 

In addition to polypeptides formed from amino 
acid units alone, polypeptides that undergo post- 
15 trans lational modification may also be produced by the 
method of the present invention. Exan5>les include 
glycoproteins, lipoproteins, N- terminal acetylated 
proteins, and the like which can be produced by host 
cells that naturally accomplish such post-translational 
20 modifications. Polypeptides containing amino acids that 
cannot be handled by cellular biosynthetic pathways, 
e.g. D-amino acids and synthetic amino acids, usually 
cannot be produced by the method of this invention 
unless specific genetic alteration of the cellular 
25 system is made to account for them. 

Preferred single copy or multiple iinit 
embodiments of the variable fused polypeptide include 
the sequences for the preferred embodiments of product 
peptides mentioned in the general section above. The 
30 peptide sequences for some of these polypeptides are 
known as indicated in Table 4. The disclosures of the 
references cited in Table 4 are incorporated herein by 
reference. 



35 
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Table 4 

Peptide Sequences 

1. Caltrin: 

Lewis, R.V., San Agustin, J,, Kruggel, W., and 
5 Lardy, H^A. (1985) Proc. Natl> Acad> Sci. USA 

82, 6490-6491; 

2. Calcitonin (3 sequences): 

Niall, H.D., Keutmann, H.T., Copp, D.H., and 
Potts, J.T. Jr., (1969) Proc. Natl, Acad, Sci. 

10 USA 64, 771-778; Keutmann, H.T., Lequin, R.M., 

Habener, J.F., Singer, F.R., Niall, H.D., and 
Potts, J.T. Jr., (1971) Endocrinology > 
Proceedings of the Third International 
Symposium ^ Taylor, ^S., ed., p. 316-323, 

15 Heinemann Medical Books, London; 

3. Vasopressin: 

Sausville, E., Carney, D., and Battery, J. 
(1985) J. Biol. Chem. 260, 10236-10241; 

4. Oxytocin: 

20 Pierce, J.G., Gordon, S., and du Vigneaud, V. 

(1952) J. Biol. Chem, 199, 929-940; 

5. Leukemia Inhibitor factor: 

Gearing, D.P., King, J. A., and Gough, N.H. 
(1988) Nucl. Acids Res. 16, 9857; 
25 6. LH-RH has the following sequence: 

Glu His Trp Ser Tyr Gly Leu Arg Pro Gly; See 
Metzler D.E. in Biochemistry (1977), Academic 
Press, London. 

7. ACTH (adrenocorticotropin, human) has the following 
30 sequence: 

Ser Tyr Ser Met Glu His Phe Arg Trp Gly Lys Pro 
Val Gly Lys Lys Arg Arg Pro Val Lys Val Tyr Pro 
Asn Gly Ala Glu Asp Glu Ser Ala Glu Ala Phe Pro 
Leu Glu Phe; See Metzler D.E. in Biochemistry 
35 (1977), Academic Press, London. 

8. Growth hormone release factor: 
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TryAlaAspAlallePheThrAsnSerTyrArgLysValLeuGly 
GlnLeuSerAlaArgLysLeuLeuGlnAspileMetSerArgGlnGl ; 
n Gly61uSerAsn61nGluArgGlyAlaArgAlaArgLeu-llH2 
(J. River et al.. Nature , 300, 276 (1982). 
5 9. insulin-like Growth Factor I (IGFI) (bovine), 

10. Insulin-like Growth Factor II (bovine): 

Honegger, A., and Humbelr R.E, (1986 3. Biol. 
Cham. 261, 569-575; 

11. Epidermal Growth Factor (Urogastrone) : 

IQ Gregory, H., and Preston, B.M. (1977) Int. J. 

Peot. Protein Res. 9, 107-118; 

12. Cecropins Attacins: 

Jayne, S.J.M., Xanthopoulos , K.G., Destefano- 
Beltran, L., and Dodds, J.H. , (1987) BioEssays 
15 6, 2630270; 

13. Angiotensin II: 

Asp Arg Val Tyr lie His Pro Phe; See Metzler 
D.E. in Biochemistry (1977), Academic Press, 
London. 

20 14. Substance P: 

Arg Pro Lys Pro Gin Gin Phe Phe Gly Leu Met; 
See Alberts B., Bray D., Lewis J., Raff M., 
Roberts K., and Watson J.D. in Molecular 
BiQloov of the Cell . 2nd ed. (1989). 

25 15. Cholecystokinin-8 : 

Asp Tyr Met Gly Trp Met Asp Phe; See Alberts 
B., Bray D., Lewis J., Raff M., Roberts K., 
Watson J.D. in Molecular Biology of the Cell, 
2nd ed. (1989). 

30 

III. A. 2. Expression Vector Incorporating 
Recombinant Gene 

The final recombinant expression vector 

incorporating the protein purification construct gene is 

35 compatible with the host cell. The vectors used will 

have many features in common. These features include an 

origin of replication compatible witK the host cell, 

regulatory DMA sequences for transcription and 
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regulation of transcription (for inducible systems), an 
efficient ribosomal binding site (for prokaryotic 
hosts), a poly-A signal (for eukaryotic hosts). In 
addition, phenotype genes, regulatory regions and leader 
5 sequences can be included. 

The prokaryotic vectors such as those for 
expression in Escherichia coli are characterized by an 
origin of replication, a genetic marker (phenotype) for 
selection of transformed bacteria, and DNA regulation 

10 sequences that will direct the expression of the gene of 
interest. The regulation sequences typically will 
include a promoter (P) to drive the transcription, an 
operator (0) to control transcription (on/off switch), 
an efficient ribosome bindijig site (RBS) to start 

15 translation, and a transcription termination signal. 
The start and stop codons are provided by the inserted 
(protein purification construct) gene. 

The prokaryotic vector in particular contains a 
suitable "expression cassette" which is based upon any 

20 of the available promoter /operator systems. Typical 
promoters for inclusion in the prokaryotic vector 
include lactose, tryptophan, T7, lipoprotein, alkaline 
phosphatase, lambda leftward or rightward promoter or a 
combination of these (hybrid promoters). The lactose 

25 and tryptophan operators, as well as temperature 

sensitive lambda promoters are typical on/of f switches 
that can be included in the prokaryotic vector. Typical 
phenotypic markers for inclusion in the prokaryotic 
vector include genes for development of resistance to 

30 ampicillin, tetracycline, kanamycin, chloramphenicol. 

The preferred two-plasmid prokaryotic system 
contains the T7 promoter to drive the expression of the 
gene of interest (protein purification construct gene) . 
The second plasmid provides a T7 lysozyme protein for 

35 the tight control of expression. The T7 RNA polymerase 
gene, under the control of any of the foregoing 
promoter /operator combinations, can be located on a low 
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copy number plasmid, integrated in the chromosome or 
supplied by a lambda bacteriophage upon infection. The ' 
second plasmid preferably also carries a different 
genetic marker (i.e., a different phenotypic marker). 
5 In an alternative embodiment, expression in 

Bacillus subtilis can be accomplished by practice of a 
standard procedure such as those described in Ganesan, 

and Hoch, J-A. ed, , Genetics and Biotechnology of 
Bacilli Vol. 2, (1987) Academic Press; Winnacker, E.L., 
10 From Genes to Clones (1987) 209-222, VCH publishers; and 
Hardy, K.G. and Glover, D.M., ed. , DNA Cloning (1986) 
Vol. 2, 1-18, IRL Press Ltd., Oxford, David B. Goeded, 
Methods In Enzvmoloav . 185 , American Press, (1990) the 
disclosures of which are incorporated herein by 

15 reference. 

The eukaacyotic vectors such as those for 
expression in the yeast, Saccharomvces cereyisiae, 
typically are shuttle vectors which contain an origin of 
replication for coli and one for Cereyisiae, a 

20 genetic marker for both cell types, and DNA regulation 
sequences that will direct the expression in yeast. The 
regulation sequences typically include a promoter, a 
regulatory sequence, and a transcription termination 
signal (including a polyadenylation signal). Optional 

25 signal sequences for direction of cellular secretion can 
also be inserted into the eukaryotic vector. 

Typical markers to be incorporated provide 
positive selection by complementation of mutations in 
the genes necessary for production of uracil, leucine, 

30 histidine, adenine, tryptophan and the like. Promoter 
sequences which preferably can be incorporated into the 
eukaryotic vector include alcohol dehydrogenase I or II, 
glyceraldehyde phosphate dehydrogenase, 
phosphoglycerokinase, galactose, tryptophan, mating 

35 factor alpha and the like. 

Expression systems for insect cells include the 
BEVS system as described in U.S. Patent No. 4,745,051 



» 
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the disclosure of which Is incorporated herein by 
reference . 

IXI. A. 3. Transformed Microbe 

5 The microbes used for trans feet ion and 

transformation with the expression vectors can be 
prokaryotic or eukaryotic cells based on unicellular or 
multicellular organisms such as those from mammalian or 
insect cells. Preferably the unicellular or simple 

10 multicellular organisms can be used as a basis for the 
appropriate host cells. Preferred host cells include 
E. coli and Saccharomvces cerevisiae as well as 
Spodoptera f rupiperda . 

A preferred embodiment, coli strain 

15 BL21(DE3) pLysS, can be used for the two-plasmid 
prokaryotic system. Other Ej_ coli strains are also 
available and incorporate the characteristics for 
expression according to the invention. Yeast 
Saccharomvces cerevisiae deficient in the production of 

20 several proteases is a preferred embodiment of a 
eukaryotic host for overproduction of the protein 
purification construct. Temperature sensitive yeast 
mutants are available and are included as preferred 
eukaryotic embodiments. 

25 

III. B. Methods for Performing Expression Steps 

Recombinant vectors are constructed using 
standard procedures and protocols for restriction, 
30 insertion, ligation, transformation, screening and 

analysis as described in Sambrook et al., cited supra , 
the disclosure of which is incorporated herein by 
reference. 

35 III. B.l Methods for Forming Recombinant Vectors 
The recombinant vector may be formed by 
restriction of the base vector with a restriction enzyme 
that is specific for the appropriate restriction site; 
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followed by insertion and ligation of schemes for 
ultimate construction of the protein purification 
construct gene. As characterized above, procedures for 
the restriction^ insertion and ligation are found in 
5 Sambrook et al., cited supra, 

III. B.2 Methods for Multi-unit Gene 

Two strategies can be followed for the 
construction of a multiple unit gene for the variable 

10 fused peptide. The first approach involves ligating all 
the individual gene units and the DNA sequences for the 
intraconnecting peptides together into an intermediate 
multi-unit gene . This intermediate multi-unit gene is 
then transferred to the fusion expression vector by 

15 ligating it directly adjacent to the DNA sequence coding 
for the interconnecting peptide. The second approach 
involves the sequential ligation of each individual gene 
unit and the DNA sequence for the intraconnecting 
peptide into the fusion expression vector. After one 

20 such gene unit plus DNA sequence has been inserted, the 
second one is ligated directly adjacent to the first 
one, then the ligation is repeated. In both cases, an 
individual gene unit for .one or more a product peptides 
and the corresponding innercoimecting peptides forming 

25 the unit until all units and sequences for 
intraconnecting peptides are inserted. 

III. B.3 Methods for Transforming Host Cells 

The transformation procedures fall into several 

30 categories depending upon the type of cell to be 

transformed. Preferred embodiments for transformation 
of E. Coli and S^. cerevisiae cells include the Mandel, 
Hanahan and Rothstein protocols. Briefly, the cells are 
grown to a certain density, at which point they are 

35 treated with a solution to render the cell wall or 

membrane permeable. These cells are then mixed with the 
recombinant DNA. Opon uptake of the recombinant vectors 
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and recuperation of the cells, transformed cells are 
screened for the desired characteristics through the use 
of selection medium. 

5 III. B.4 Method for Control of Expression 

The method for expression typically depends on 
the host transformed with the vector. In one 
embodiment, induced expression can be practiced by the 
addition of isopropyl thiogalactoside (IPTG) to the 

10 bacterial culture such as coli. Cells are grown to a 
desired cell density and then IPTG or lactose is added 
to induce transcription. Other expression control 
mechanisms can be practiced through temperature 
regulation, nutrient stainra^ion or indole acrylic acid 

15 influence. For example, the two-plasmid system can be 
modified to be responsive to temperature, nutrient 
starvation or indole acrylic acid (lAA) induction of 
expression. For expression in yeast, identity of carbon 
source in the growth medium or the incubation 

20 temperature can function as a control factor to induce 
expression. In a preferred embodiment, expression can 
be induced by recombining the vector with an appropriate 
temperature sensitive promoter and by either lowering or 
increasing the incubation temperature. Single- or two- 

25 plasmid systems can be regulated in this fashion. 
Alternatively, temperature control of plasmid copy 
number may also produce large amounts of the fusion 
protein through the use of runaway replicons. 

The use of an inducible, as opposed to 

30 constitutive, expression is important for the purposes 
of the present invention. hCA is highly toxic to E. 
coli yeast and many other host cells. Establishment of 
optimum culture density before the protein purification 
construct expression is triggered minimizes the chance 

35 that intra-cellular accumulation of the expressed 

protein purification construct will harm the production 
yield. 
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III. B,5 Method for Culturation 

Following transformation and selection , the 
cells are cultured to produce optimum quantities of 
5 protein purification construct. To grow, a mature 

culture r a volume of suitable medium is inoculated with 
a single colony of the recombinant clone, grown in 
suitable growth mediiim and under appropriate temperature 
conditions until the cell culture reads the desired 
10 density. 

III. B.6 Method for Separating 

The high affinity of hCAII for sulfanilamide 
enables a rapid, efficient purification of the protein 

15 purification construct. Peptide fusions to the ends of 
hCAII do not affect the ability of the enzyme to bind to 
the substrate. The enzymatic reaction site is remote 
from the chain ends of the enzyme. To perfoinn the 
binding separation, the culture medium is combined with 

20 0.05M Tris sulfate to adjust the pH to 7.6, and passed 
through a column of Sepharose covalently bonded to 
sulfanilamide. The Sepharose-sulfanilamide bonding is 
accomplished by a Gly-Tyr linker group e.g. Sepharose- 
Gly-Tyr-sulf anilamide . 

25 

III. B.7 Method for Specific Proteins 

The purification method for the following 
protein purification construct example utilizes similar 
steps for any variable fused polypeptide. The recovery 

30 of the desired protein, however, will differ depending 
upon the interconnecting peptide and cleavage method 
used. Cleavage will depend on the particular 
interconnecting peptide and its amino acid sequence as 
well upon the fact that the product peptide should not 

35 contain the amino acid sequence of the interconnecting 
peptide . 
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IV. Protocols > Preparations and Examples 

The following protocols, prepisurations and 
examples further illustrate particular aspects of the 
invention. They are not meant as limitations of the 
5 scope of the invention which is fully set forth above. 

IV. A. Synthetic Gene Protocol 

1. P eptide and DNA Sequences for 
10 the Product Peptides 

Table 3, above, provides the peptide and DNA 

sequences for the preferred embodiments of the product 

peptide. The oligonucleotides representing these DNA 

sequences can be synthesized by automated techniques 

15 described generally by S.L.^ Beaucage and U.H. Caruthers, 
Tet. Letters , 221 , 859-62 (1981), the disclosure of 
which is incorporated herein by reference. Typically, 
a Vega Coder 300 DNA synthesizer or a ABI model 3 8 OB can 
be used. Alternatively, native DNA coding for the 

20 desired protein Ccui be isolated by known purification 

techniques described in Sambrook et al . , cited supra the 
disclosure of which is incorporated herein by reference. 

IV. B. Protocols for Insertion, Transformation 
25 Procedures for methods to restrict, ligate, 

trans fozm, select, culture, and lyse according to the 
invention, generally follow standard methods known in 
the art. Literature providing the details for these 
methods include, Sambrook et al., cited supra . The 
30 textual material of each of these references is 
incorporated herein by reference. 

In a preferred embodiment DNA sequences can be 
ligated together according to the procedure described in 
Sambrook et al., cited supra . 



35 



IV. B. 1. Restriction Protocol 



Digesting DNA with Restriction 
Endonucleases 
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Typically, 0.5 to 2 ug of plasmid DNA is 
digested in 20 ul of a IX restriction buffer with 1-20 
units of restriction enzyme. The reaction mix is 
incubated for 1 to 16 hours at the temperature 
5 recommended by the enzyme supplier. The reaction mix is 
then either size fractionated by gel electrophoresis or 
the DNA is further purified by standard procedures as 
described in Sambrook et al.^ cited supra . 

10 IV. B. 2. Insertion and Ligation Protocol 

Typically, the cloning vector is digested with 
the appropriate restriction enzymes as described in the 
foregoing restriction protocol. The linearized vector 
can then be dephosphorylated with calf intestinal 

15 phosphatase (CIP) or bacterial alkaline phosphatase 
(BAP) under conditions known to those of skill in the 
art, e.g., as suggested by the supplier. The DNA is 
then further purified by standard procedures (see 
Sambrook et al., cited supra), which usually involve a 

20 phenol extraction and ethanol precipitation. 

The DNA segment to be inserted is then mixed in 
a 3-5 fold (for large fragments) or 20-30 fold (for 
short oligonucleotides) molar excess with this precut 
cloning vector. The ligation is perfoirmed in a IX 

25 ligation buffer (20 mM Tris pH 7.6, 10 mM MgClz, 0.4 mM 
beta-mercaptoethanol, 0.4-1.0 mM ATP), in the presence 
of T4 DNA ligase, at 16 *C for 16 hours. An aliquot of 
the reformed vector is then used to transform competent 
E, coli cells and select for recombinant plasmids. 

30 

IV. B. 3. Transformation Protocol for E. Coli 

In a preferred embodiment, the recombinant 
vector can be taken up by Ej_ coli according to the 
calcium chloride procedure given below. The alternative 
35 transformation protocol can also be used- 



Bacterial Transformation by the 



Calcium Chloride Procedure (Mandel 
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Protocol ) 

This procedure will transform (prokaryotic) 
bacteria with plasmid DNA (after Handel and Higa 1970). 
After inoculating Luria broth (described by 
5 Sambrook et al., cited supra ) with a bacterial culture, 
the cells are grown with agitation at optimum 
temperature to a density of about 5 X 10^ to 10^ 
cells /ml. The culture is chilled to about 0*C, 
centrifuged and the cells collected. The cells are then 
10 resuspended in an ice-cold, sterile solution of 50 mM 
CaClj and 10 mM Tris-Cl (pH 8.0). This centrifuge and 
resuspension step is repeated one more time. The result 
is a concentrated suspension of treated cells that are 
ready to accept the new vector. 

15 For maximiim transformation efficiency, the 

bacterial culture preferably is in a logarithmic phase 
of growth; the cell density preferably is low at the 
time of treatment with calcium chloride; and the treated 
cells preferably are maintained at 4*C for 12-24 hours. 

20 To take up the vector, an aliquot of the 

ligation reaction is added to the suspension of treated 
cells. The combination is mixed and stored on ice for a 
short time. Up to 40 ng of DNA (dissolved in up to 100 
^1 of ligation buffer or T£) can be used for each 

25 transformation reaction. 

Next, the transformed cells in culture tubes 
are transferred to a 42 *C water bath for 2 min. An 
aliquot of L broth is added to each tube and the cells 
incubated at 37 'c for about 30 minutes (tetracycline 

30 selection) or 1 hour (ampicillin or kanamycin 

selection) . This period allows the bacteria to recover 
and to begin to express antibiotic resistance. The 
cells are spread onto selective media and are incubated 
at optimum temperature. Colonies will appear overnight. 

35 (Adapted from Sambrook et al., cited supra.) 



b. Alternate Transformation Protocol 
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Several colonies can be removed from a freshly 
streaked SOB agar plate (i.e., Luria broth supplemented 
with magnesium chloride, -sulfate and potassium 
chloride) and dispersed in SOB medium. The cells are 
5 inoculated into an Erlenmeyer flask containing SOB 
medium and incubated at optimum temperature with 
moderate agitation until the cell density is about 4-7 X. 
10^ viable cells/ml. The cultxire is then collected and 
chilled on ice for a short time. The cells are pelleted 

10 by centrifugation at 750 - lOOOg at ice temperature. 
The cells are resuspended in culture volume of 
transformation buffer (TFB) and incubated on ice for a 
short time. The cells are repelleted and drained 
thoroughly as before, then ^resuspended in TFB. 

15 To adapt these cells to accept transformation, 

a DMSO and dithiothreitol DTT solution (DMSO and DTT 
(DnD)) is added to the cells. The cells are then 
incubated on ice for a short time. A second, equal 
aliquot of DMSO and DTT (DnD) as above is added to give 

20 about a 7% final concentration. The cells are incxibated 
on ice for 10-20 minutes. 

To obtain transformation, chilled cells and the 
vector DNA solution are combined in a volume of less 
than 20 All. The cells are incubated on ice for a short 

25 time, then heat shocked by placing them in a water bath 
at about 42 *C for 90 seconds. These cells are returned 
onto ice to quench the heat shock, allowing 2 minutes 
for cooling. SOC SOB with glucose medium is added to 
each culture and the cultures incubated at about 37'c 

30 with moderate agitation for 30-60 minutes. The cells 
are then spread on agar plates containing appropriate 
additives to select transformants. 

IV. B. 4. Transformation Procedure for Yeast 
35 Recombinant plasmids are constructed using E^. 

coli/ S. cerevisiae shuttle vectors. These plasmids are 
capable of replicating in both hosts. An aliquot of the 
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ligation mixture is used to first transform an coli 
host. Recombinant clones are analyzed (using standard 
procedures described in Sambrook et al . , cited supra 1 
and positive ones identified. Plasmid DJUh is isolated 
5 from these clones (according to procedures described in 
Sambrook et al., cited supra ) and used to transform the 
yeast host. Several transformation procedures exist for 
yeast. The two mainly used are the spheroplast- and 
lithium method, and are described below. Both are 
10 adapted from Rothstein, R., DNA Cloning (Glover DM. Ed) 
vol. 2, pp. 45-66, IRL Press Ltd., Oxford (1986). 

Yeast cells are grown to a density of 1-2 X 
10^/ml in yeast extract, peptone, dextrose (YPD) liquid 
medium. The cells are washisd twice with sorbitol 
15 solution by repeated pelleting followed by cell 

resuspension and a final resuspension in sorbitol and 
mercaptoethanol . 

A diluted aliquot is plated on YPD solid 
medium. This will serve as the control to calculate the 
20 percentage of killing after spheroplast formation. 

Glusulase enzyme (Endo Laboratories, Inc.) is added to 
the final suspension and incubation is conducted at 30 *C 
with gentle shaking for abput 40 minutes. An aliquot is 
taken again on YPD medium to measure the percentage 
25 survival after spheroplast formation. 

The cells are prepared for transformation as 
follows. The spheroplasts are pelleted and then gently 
resuspended in sorbitol solution. This procedure is 
repeated several times followed by resuspension of the 
30 spheroplasts in sorbitol solution. Tris-HCl pH 7.4 and 
0.1 M CaClj solution are then added. The spheroplasts 
are pelleted and resuspended in sorbitol solution, 10 mM 
CaClj • 

To transfer the spheroplasts, aliquots are 
35 distributed and vector DNA (1-10 }ig) added while keeping 
the total volume of DNA added to less than 20 ^il. The 
cells and DNA are incubated for a short time at room 
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temperature, then resuspended and combined with 2 ml of 
50% (w/v) polyethylene glycol mixed with 0.2 ml of 0.1 Mr 
Tris-HClr pH 7.4 and 0.1 M CaClj. 'After 10 minutes at 
room temperature r the spheroplasts are pelleted, 
5 resuspended in sorbitol solution, 10 mM Tris-HCl, pH 7.4 
and 10 mM CaCla. The spheroplasts are plated in 
regeneration agar (maintained at 45-50*C) onto the 
appropriate omission medium. 

10 IV. B. 5. Alternative Lithium Transformation 

Procedure for Yeast 

A culture of yeast is grown in YPD medium. The: 

cells are pelleted and then resuspended in tris EDTA 

(TE). The resuspended cells are treated with 0.1 M LiCl 

15 (or 0.3 M LiOAc) in TE and incubated at 4*C. 

To transform the cells, 200 /il of cells per . 
transformation reaction are combined with 1-10 ^1 of 
vector DNA, keeping the volume of DNA added to less than 
1/10 the total volume. The cell-DMA mixture is 

20 incubated for 30 minutes at room temperature. The cells 
are resuspended in medium and then 1.5mlof40% 
polyethylene glycol is added. The mixture is incubated 
at room temperature. The cells are heat shocked at 42 *C 
for 5 minutes, and then centrifuged. The cell pellets 

25 are washed with water by repeatedly centrifuging and 

gently resuspending- Finally, the cells are resuspended 
in water and plated on appropriate omission medium. 
Adapted from Roths tein, cited supra . 

30 IV. C. Selection Protocol 

In a preferred embodiment, transformed clones 
can be selected by their phenotype (antibiotic 
resistance) by plating on selective medium containing 
appropriate amounts of antibiotics. 

35 Transformed Ej^ coli are selected through the 

use of plates containing the appropriate antibiotic 
(i.e., the one to which resistance is conferred by the 
introduced plasmid) . Typical final concentrations are: 
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amplcillin at 100 ug/ml, chloramphenicol at 10 ug/ml, 
kanamycin at 50 ug/ml, streptomycin at 25 ug/ml 
tetracycline at 15 ug/ml. When using E^. coli 
BL21(DE3)pLysS as the host, trans formants are plated out 
5 on medium containing both ampicillin and chloramphenicol 
( at the above concentrations ) • 

Transformed Sj^ cerevisiae cells are selected 
through the use of nutrient omission plates (described 
by Rothstein, cited supra) lacking uracil, tryptophan, 
10 leucine, histidine or adenine (depending on the plasmid 
and host that is used). 

IV. D. Cttlturing Protocol 

In a preferred embodiment, the method for 

15 culturing cells can be practiced as described in 
Sambrook et al., cited supra > Briefly, the method 
entails transferring a single bacterial colony to a 
small volume (3-5 ml) of bacterial growth medium (such 
as Luria Broth) containing an appropriate antibiotic. 

20 The culture is incubated at 37 *C (or other appropriate 
temperature) and scaled up to larger volumes. Yeast 
cells are cultured similarly, using yeast omission 
medium and incubating the cells at 30 *C. 

25 IV. E. Lysing Protocol 

In one embodiment, cells can be lysed with 
lysozyme according to the protocol as follows: 

1. Lyse the cells in l/lOth the volume of the 
original culture of SOmMTris (pH 7.9)-l/2mM 

30 EDTA containing Img/ml lysozyme. Keep on ice 

for 60 minutes. 

2. To the now viscous suspension, add: MgCl2 to a 
final concentration of lOmM, RNAase to 1-2 
ug/ml, and DNAase to 5 ug/ml. Incubate on ice 

35 for 30-45 minutes, (until the solution is no 

longer viscous ) . 
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3, Spin at 4 C/10000 rpm (SS34 rotor)/5 minutes. 
Save the supernatant. 

In another embodiment the cells can be lysed by 
a single freeze-thaw cycle provided that the BL21(DE3) 
5 pLysS host is used or any other host containing the 
pLysS plasmid. 

IV. F. Liaand Immobilized Affinity Separation 
Protocols and Preparations 

10 An aspect of the purification sequence for the 

protein purification construct involves immobilization 
of sulfanilamide when hCAII is the affinity binding 
agent. In this case, three kinds of sulfonamide 
affinity substrates, three linking methods, two matrices 

15 and two ways of eluting the hCA are exemplified. These 
protocols are given below, however, there are many 
possibilities for combinations of immobilization matrix, 
substrate and linker. 

The first method uses a polyamide matrix, 1- 

20 ethyl-3(3 dimethylaminopropyl ) carbodiimide acid as the 
linking agent and p-amino methyl benzene sulfonamide as 
the affinity substrate. Elution of the bound protein is 
accomplished with NaNa. 

25 IV. F. 1. Protocol A 

Formation of Immobilized Sulfanilamide 
a. Materials 
a. A polyamide gel such as CM BioGel A (from 
BioRad, Inc - ) 

30 b. l-ethyl-3{3 dimethylaminopropyl) carbodiimide 

c. p-AminoMethyl Benzene Sulfonamide (pAMBS) 

d. Acetone, Carbonate, Bicarbonate, Tris. KjSO^ 
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b. Coupling Reaction 
The reaction to couple sulf aniiamide to the 
solid support may be conducted in the following manner. 
The gel is washed several time with H2O, then with 50% 
5 Acetone-water. The washed gel is placed in a 1-liter 
flask and 50% acetone-water is added to form a slurry. 
pAKBS (3g) is dissolved in 50% Acetone-H20 at pH 4.8 and 
the solution is added to the slurry to form a component 
mixture • 

10 To mobilize the pAMBS, EDAC (5g) is dissolved 

in H2O to form an EDAC solution just prior to the 
coupling reaction. The EDAC solution is added dropwise 
to the component mixture at room temperature while 
maintaining the pH at 4.8 by. HCl addition. After all 

15 EDAC is added the pH is kept at pH 4.8 for about 6-8 
hours, the reaction is stirred at ambient temperature 
and allowed to proceed for a total of 20 hours. 

c. Washing 

20 The coupled gel is washed in a fretted disk 

funnel with the following sequence of solutions. 

a. 150 ml 50% Acetone-water; 

b. 1 liter water; 

c. 1.5 liter hot Carbonate-Bicarbonate (1:1); 
25 d. 150 ml 50% Acetone-water; 

e. 700 ml Acetone; 

f • 2 liters water; and 

g. 1 liter of 0.1 M Tris, 0.2M K2SO4 at pH 9.0. 

30 d. Gel Storage 

The gel is kept in Tris buffer at pH about 7.5 
with a trace of azide to prevent bacterial growth. 

The second method employs a Sepharose matrix, 
to which p-aminobenzenesulfanilamide is coupled via a 
35 glycyl-L- tyrosine spacer arm. The bound hCA (or protein 
purification construct) is eluted with potassium 
thiocyanate . 
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IV. F. 2. Protocol B 

Sepharose-Gly-Tyx may be prepared from 
Sepharose 4 B freshly activated with cyanogen bromide 
5 (180 mg/ml packed Sepharose) by coupling with Gly-Tyr 
(5mg/ml packed Sepharose) in 0.1 M-sodium bicarbonate, 
pH 9.5- The affinity gel is obtained by diazotization 
of p-aminobenzenesulfanilamide and coupling of this 
compound to the Sepharose-Gly-Tyr . p- 

10 aminobenzenesulfanilamide (0.3g) is suspended in 120 ml 
of ice-cold 1 M-HCl, and to the suspension is added 0.9 
g of sodium nitrite in 45 ml ice-cold water. After 10 
minutes of reaction the diazotized benzenesulfanilamide 
is poured into a 600 ml suspension, containing 300 ml 

15 packed Sepharose-Gly-Tyr in 0.2 M-sodium bicarbonate pH 
8.8. The pH is adjusted to 9.5 with 1 M-NaOH and, after 
gentle stirring for 3 hours at room temperature, the 
coupled red Sepharose derivative (SA-Sepharose) can be 
washed with 10 liters of water and 2 liters of 0.05 

20 Tris-sulfate pH 7.6. 

IV. F. 3. Protocol C 

The third method uses the same Sepharose matrix: 
described above, except that the ligand is p- 
25 carboxybenzenesulfanilamide and it is coupled to the 
resin with ethylenediamine . 

The Sepharose-ethylamine derivative may be 
prepared by adding 115 ml of an aqueous solution 
containing 15 ml ethylene diamine, adjusted to pH 10 to 
30 100 ml freshly activated packed Sepharose. The affinity 
gel is prepared by coupling p- ' 

carboxybenzenesulfanilamide to the Sepharose ethylamine 
by the carbodiimide method. One hundred milliliters of 
wet packed Sepharose was suspended in 400 ml 
35 dimethyl formamide containing 1 g of p- 

carbo^benzenesulfanilamide. A portion of EDC (5 g in 
30 ml water pH 4.7) is added slowly, and the pH 
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maintained at pH 4.7. After 20 hours of reaction at 
room temperature the affinity gel, Sepharo8e-*ethyl(p- 
carboxybenzeneBulfanilamide) carboxamide (SE-Sepharose) 
can be washed with dimethyl formamide and a large volume 
5 of water* 



IV. F, 4. Separation Procedure 

To separate under Protocols A, B and the 
following procedure can be used. The lysate from the 

10 cell culture is first pumped through a column with the 
SA-Sepharose equilibrated with 0.05 M-Tris -sulfate pH 
7.5. After washing with 0.05 M-Tris-sulf ate-lM-sodium 
sulfate pH 7.5 to remove other material, the protein 
purification construct (containing carbonic anhydrase), 

15 can be eluted with 0.2 M-potassium thiocyanate in 0.05 
H-Tris-sulfate pH 6.5. 



IV. 6. Protocols for Cleaving 

The methods for cleaving the inter, intra and 
20 innerconnecting peptide depend upon the amino acid 
sequence of the inter, intra and innerconnecting 
peptide • 

1. Protocol A 
25 Enterokinase cleavage (AspaLvs sequenced 

The fusion protein eluted from the column is 
made 10 mM in Tris (pH 8.0). Bovine enterokinase is 
prepared as a 100 ug/ml lOmH Tris (pH 8.0) stock 
solution. An appropriate aliquot is added to the fusion 
30 protein solution (10% by weight) and incubated at 37 *C 
for 15 hours. 



2. Protocol B 

Factor Xa ( IleGluGlvArg^ 
35 Cleavage can be performed on ice by combining 

enzyme and substrate at a ratio of 1:200 (wt/wt) and 
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allowing the reaction to proceed for 12 hours in SOinM 
Tris (pH 8.0)/l00 mM NaCl. 

3. Protocol C 

5 Thrombin f AroGlvProArg or resembling 

Structures like AraAlaProLvs ) 

Thrombin cleavage can be performed while the - 

fusion protein is still attached to the affinity resin. 

Thrombin is then added at a concentration of 1% (w/w 

10 fusion protein) and incubated for 1 hour at 37 C. 

4, Protocol D 

Collaaenase 

Purified collagenase of Achromobacter iophaqus 
15 is dissolved in 10 mM Tris/ 0.25 M NaCl/ 10 mM CaCla/ 10 
mM 2-mercaptoethanol, final pH 7.4 at a concentration of 
1 mg/ml. Ten mg of protein purification construct 
protein at a concentration of .5-3 mg/ml is digested 
with 10 ul of the collagenase stock solution for 10 min. 

20 at 30^*0. 

Consensus collagenase cleavage site: 
-Pro-Xxx-Gly-Pro-Yyy 

4. Protocol E 

25 Ubiouitin cleaving protei n (AraGlvGlv) 

Cleavage reactions can be performed by 
combination of the enzyme and substrate at an 
appropriate ratio allowing the reaction to proceed for 
12 hours. 

30 

5 . Protocol F 

CNBr (Methionine) Chemical Cleavage 
The protein is dissolved at a concentration of 
10-20 mg/ml in 70% formic acid, CNBr is added to a final 
35 concentration of 2 mg reagent per mg polypeptide, and 
the mixture is incubated for 18 - 20 hours at ambient 
temperatures. This method is limited to desired 
proteins that are stable in 70% formic acid and do not 
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contain methionine residues within their amino acid 
sequences • 

6. Protocol G 

5 Hydroxvlamine (AsnGly) 

A solution of 2M hydroxylamine hydrochloride 
and .2M trisbase is titrated to pH 9.0 at 45 'c. The 
fusion protein is dissolved in this solution at 45 *C for 
4 hours. The reaction is terminated by lowering the 
10 temperature degree and adjusting the pH to below 8.0. 

7. Protocol H 

Chemical Cleavage at Cysteine Residues 
The fusion protein^is dissolved in lOOmH Tris 
15 (pH 8.0)/5mM DTT. 2-Nitro-5-thiocyanobenzoate is then 
added and the reaction is allowed to proceed for 20 
minutes at room temperature. Raising the pH to 9.0 
ensures cleavage at the N-terminal side of the now 
modified cysteine residue* 

20 

8 . Protocol I 

Cleavage with Staphylococcus Aureaus V8 Protease 

lOpg of fusion protein is digested with l}xg of 
protease at 37 *C in 50mH ammoniumbicarbonate (pH 7.8) 
25 for up to 18 hours. 

9. Protocol J 

Renin Cleavage 
The fusion protein is dissolved in 50mM 
30 sodiumphosphate (pH 7.1), 5mM EDTA, 5mM 

sodiumtetrathionate . Renin is added at a ratio of 
1:1000 to the fusion protein and the mixture is 
incubated at 37 *C for 16 hours. 
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10. Protocol K 

Cleavage at Tryptophan Residues 
with BNPS-skatole 

The fusion protein is dissolved in 50% acetic 

5 acid at a concentration of 10 mg/ml. A freshly prepared 

solution of BNPS-skatole (2-(2-nitrophenylsulfenyl)-3- 

methyl-3-bromoindolenine) in acetic acid is added at a 

100-fold molar excess (versus tryptophan residues in the 

protein) . The mixture is incubated at room temperature 

10 for 48 hours in the dark. Tyrosine can be added as a 

scavenger. 

IV. H. Final Separation Protocol 

After the desired protein is cleaved from the 
hCAII moiety by a chemical or enzymatic method, 
separation of the constituents can be made by means 
known to those of skill in the art. For example, the 
chemical method produces upon cleavage two proteins, the 
hCAII with attached interconnecting peptide and the 
variable fused polypeptide. The hCAII plus 
interconnecting peptide can be removed by chromatography, 
on a sulfanilamide column. The variable fused 
polypeptide elutes from the column. 

For the enzymatic method, the cleavage mixture 
contains three proteins, hCAII plus the attached 
interconnecting peptide, the variable fused polypeptide, 
and the protease (cleaving enzyme) . hCAII can be 
removed by sulfanilamide immobilization as above. The 
protease can be separated from the variable fused 
polypeptide by use of the chemical and/or physical 
properties of the desired peptide and protease including 
size exclusion, ion exchange or affinity chromatography. 

V, Examples 

Example 1 
Synthesis of Gene and Vectors 
Five different plasmids were constructed, all 
of which coded for a human carbonic anhydrase II (CAII) 



wo 92/01707 



PCT/US91/04S11 



59 

C-terminal fusion protein. Each plasmid included the 
gene for the foregoing fusion protein under the 
transcriptional control of the T7 promoter. The fusion 
protein itself consisted of a truncated hCAII protein 
5 which lacked the C-terminal three amino acids ^ followed 
by the enterokinase recognition sequence 
(ValAspAspAspAspLys) , followed by the protein of 
interest. The five genes that were fused onto the CAII 
gene code for angiotensin (a ten amino acid (AA) 
10 protein)/ a calcitonin derivative (a 32 AA protein), 
bovine caltrin (a 47 AA protein), the delta subunit of 
the coli ATPase (a 174 AA protein) and the epsilon 
subunit of the coli ATPase (a 173 AA protein). The 
first three genes were encoded by synthetic genes, the 
15 last two were isolated from the previously cloned unc 
operon. The synthetic genes were designed using optimal 
codon usage for E^ coli and contain unique and useful 
restriction sites for future alterations. The 
expression system that may be used initially uses the T7 
20 promoter to direct transcription of the fusion gene by a 
chromosomal ly encoded T7 RNA Polymerase which is 
produced upon induction by isopropylthiogalactoside 
(IPTG). 

In an alternate system, the gene for T7 RNA 
25 Polymerase could be encoded by a second plasmid. This 
gene could under the transcriptional control of any 
promoter compatible with the host cell (e.g. a 
temperature inducible promoter, a tryptophan promoter 
etc . ) . 

30 

Construction of p0304; 

All clones were derived from the parent vector 
pET31FlmHCA2. The T7 expression cassette of pET-3c 
(described by Rosenberg, A.H., Lade, B.N., Chui, D., 
35 Lin, S.W., Dunn, J.J., and Studier, F.W. Gene 56, 125- 
135 (1987) was cloned into a truncated pSP64. This 
truncated pSP64 already contained a DNA segment encoding 
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the Fl origin. The gene coding for human carbonic 
anhydrase II (hCAII) was then transferred to this 
expression cassette and yielded pET31FlinHCA2. This 
vector was used as the basis for the construction of the 
5 fusion vector p0304. For this purpose, two 

complementary oligonucleotides with the following 
secpience were synthesized: 5'- 
AGCTTTCGTTGACGACGACGATATCTT-3 ' and 5'- 

AGCTAAGATATCGTCGTCGTCAACG-3 ' . Upon phosphorylation of 

10 each oligonucleotide (following the procedures of 

Sambrook et al-, cited supra ), they were pooled, heated 
to 80 'C and cooled slowly to room temperature to allow 
for annealing. This double-stranded oligonucleotide 
coded for the first five amino acids of the enterokinase 

15 recognition sequence. It also contained a unique EcoRV 
site and Hindlll cohesive termini. The sequence of the 
oligonucleotide was such that, upon insertion of this 
oligonucleotide into a Hindi I I site, only the 5 '-end 
Hindi 1 1 site was recovered. This double-stranded 

20 oligonucleotide was then ligated into the parent vector 
which had been digested with Hindlll and 
dephosphorylated with calf intestinal phosphatase (CIP) 
following the procedure of Sambrook et al., cited supra. 
The ligation was performed at 16 *C for IS hours. 

25 coli K-12 strain SB221 was made competent by the CaCla 
(as described in Sambrook et al-, cited supra ) and 
transformed with an aliquot of this ligation mixture. 
Plasmid DNA was isolated from single colonies by the 
alkaline lysis method (described in Sambrook et al., 

30 cited supra), and cuxalyzed with restriction enzymes. 

The orientation of the oligonucleotide could be verified 
using the Hindlll site. One such positive clone was 
denoted p0304 which now served as the parent vector for 
the construction of fusion genes. p0304 coded for hCAII 

35 fusion protein in which the C-terminal three amino acids, 
of hCAII were replaced by part of the enterokinase 
recognition sequence. The oligonucleotide introduced a 
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unique EcoRV site, which may be used as the cloning site 
for the construction of fusion genes. If the gene 
cloned into this EcoRV site starts with a Lysine codon, 
then the enterokinase recognition sequence is created. 

5 

Construction of Several Additional Fusion Genes: 

p0304AnQ . Two complementary oligonucleotides 
(S'-AAAGACCGTGTATACATCCATCCGTTCCATCTGTAAGAT-S' and 5'- 
ATCTTACAGATGGAACGGATGGATGTATACACGGTATTT-3 ' ) were 
10 synthesized. Phosphorylation and annealing as described 
above, resulted in a double-stranded oligonucleotide 
with blunt ends and an internal AccI site. This piece 
of DNA was then cloned into the unique EcoRV site of 
p0304. Clones were analyzed as above, and one positive 

15 clone was named p0304Ang. This plasmid contained a 
fusion gene that coded for a truncated hCAII protein 
(minus the last three amino acids), fused C-terminally 
to an enterokinase recognition sequence followed by the 
angiotensin peptide. 

20 p0304Cal . pINMetCalS is bacterial expression 

plasmid that contains a synthetic gene that codes for 
bovine caltrin (Van Heeke, G.; Ph.D., Dissertation . 1989 
University of Nebraska, Lincoln) . The gene starts with 
a methionine codon and has an EcoRV site immediately 

25 upstream of it. Digestion of this plasmid with EcoRV 
and BamHI released a fragment containing the entire 
caltrin coding sequence including the methionine codon 
and a 3' untranslated region of about 150 bp. This 
reaction mixture was treated with T4 DNA polymerase to 

30 flush the BamHI overhang and the caltrin containing 

fragment was cloned into the EcoRV site of p0304. This 
resulted in an in-frame alignment of the caltrin gene 
with the hCAII gene. Restriction enzyme analysis was 
used to determine the presence and orientation of the 

35 insert. One positive clone was named p0304Cal. It 

contained a fusion gene that coded for hCAII (minus the 
last three amino acids), linked to Met-caltrin via a 
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Val-(Asp)4 peptide. The enterokinase recognition 
sequence was not recovered, however, caiBr treatment can 
be used to cleave the caltrin portion of the fusion 
protein. 

5 pQ304delta . A DMA fragment containing the gene 

coding for the delta subunit of the Ej. coli Fj Fo-ATPase 
was isolated from pRPGSl (described by Gunsalus, R.P., 
Srusilow, W.S.A., and Simoni, R.D. Proc. Natl. Acad. 
Sci. USA 79, 320-324 (1982)). The delta gene was 

10 isolated from pRPGSl and subcloned into pTZlBR. A 

unique EcoRI site was introduced near the 5' end of the 
gene using site-specific mutagenesis (as described in 
Sambrook et al . , cited supra ) . A synthetic 
oligonucleotide that restored the 5' end sequence and 

15 added a lysine codon as part of a Dral site, was cloned 
into this EcoRI site. This resulted in plasmid pSlIS2.3- 
5, which was then digested with Dral and Hindlll, 
treated with T4 DNA polymerase, and the desired fragment 
was cloned into the EcoRV site of p0304. As above, 

20 restriction analysis was used to confirm the presence 
and orientation of the insert. A positive clone was 
denoted p0304delta. It coded for the aforementioned 
truncated hCAlI protein attached to the delta subunit 
via an enterokinase linker peptide (Val-(Asp)4Lys) . 

25 n0304eps . A similar route was followed for the 

epsilon subunit of the E^ coli FiFo-ATPase. The gene 
coding for the epsilon subunit with a lysine attached to 
its N-terminal end was isolated from pRPG53Eol. This 
plasmid itself was derived from pRPG5 3 (Gunsalus et al., 

30 cited supra ) . It had a lysine codon as part of a Dral 
restriction site immediately upstream of the start 
codon. Digestion with Dral released this fragment, 
which was then inserted into the EcoRV site of p0304. 
The new plasmid, named p0304eps, coded for the epsilon 

35 subunit linked to hCAII via an enterokinase recognition 
sequence. 
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D0304Calcit , A synthetic gene that coded for a 
calcitonin derivative was constructed using six separate 
oligonucleotides. The gene was designed using optimal 
codon usage for £^ coll and had restriction sites edited 
5 at regular spacing. The complete sequence is provided 
below. The synthetic gene was first assembled in 
pTZlSR. The coding sequence was designed such that a 
Oral site was present immediately upstream of the start 
signal and an EcoRV site downstream of the stop codon. 

10 An extra Ala codon was added at the 3' end, followed by 
a stop codon. The Dral site also introduced a lysine 
codon directly adjacent to the cysteine. The gene was 
then transferred as a Dral /EcoRV fragment to the EcoRV 
site of p0304. This created an in- frame alignment of 

15 the calcitonin gene with the hCAII gene. Upon 

expression, a fusion protein was produced that had the 
calcitonin-like peptide linked to hCAII via the 
enterokinase recognition sequence. 

Sequence for the synthetic Calcitonin 

20 derivative: 

5 ' -AAGCTTTAAATGCGGTAACCTGTCGACTTGCATGCTGGGTACCTAC 
ACTCAAGATCTGAACAAATTTCJ^TACTTTCCCGCAGACATCGATAGGTGTO^ 
TCCGGCTTAAGATATCGGATCC-3 ' . 

All examples below describe experiments 

25 involving the angiotensin protein purification 
construct. 



Example 2 

Transformation of coll 
30 with the Recombinant vector 

E. coli was transformed according to standard 

procedures as described in Sambrook et al . , cited supra 

(also see protocols above). ^ coli K-12 SB221 was 

transformed with an aliquot of the ligation mix by the 

35 calcium chloride method described above. 
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Example 3 
Selection of recombinant cells 
Transformed cells were plated on Luria broth 
medium containing 100 ug/ml ampicillin and incubated at 
5 37 *C for 16-18 hours. 

Example 4 

Selection of Desired Recombinant Vector 
Single colonies were grown in 3-5 ml liquid 

10 Luria broth containing 50 ug/ml ampicillin, and plasmid 
DNA was isolated using the minprep alkaline lysis method 
(Sambrook et al., cited supra). Plasmid DNA was 
analyzed with restriction enzyme AccI which indicated 
the presence of the oligonucleotide coding for 

15 angiotensin. A second restriction enzyme analysis with 
PstI and EcoRV revealed the orientation of the inserted 
oligonucleotide. The junction region of a plasmid 
construct with the oligonucleotide in the correct 
orientation was then further verified by DNA sequencing 

20 and a positive clone was named p304Ang. This plasmid 

was then used to transform E^ coli BL21(DE3)pLysS (as in 
Example 3). Transformed cells were recovered on Luria 
broth plates containing both ampicillin (100 ug/ml) and 
chloramphenicol (10 ug/ml). These cells were then used 

25 for the overproduction of protein purification 
constructs . 

Example 5 

Culturing and production of 
30 protein purification construct 

Transformed cells (BL21(DE3)pLysS containing 

p0304Ang were grown in 3-5 ml Luria broth containing 

ampicillin and chloramphenicol at 37 'c for 16-18 hours. 

The culture was then diluted 1 to 100 in fresh medium 

35 and grown until the optical density reached 0.8-1.0. At 

this point 1^ IPTG (see above) was added to a final 

concentration of 0.05-0.4 mM or lactose was added to a 
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final concentration of 0.2 to 2 percent. The culture 
was further incubated at 21-37 *C for 2-16 hours. 

Example 6 

5 Lvsino mature culture of host cells 

The cells were harvested by centrifugation and 
washed in ice-cold 50 mM Tris pH 8.0. The cell pellet 
was then frozen at -20 *C. Upon thawing of this pellet 
and resuspension in 50 mM Tris pH 8.0, the cells lysed 

10 on their own. The viscous cell extract was then treated 
with DNase (0.5 ug/ml) and RNase (1 ug/ml) in the 
presence of 10 mM MgCl2 at o'c for 45-60 minutes. The 
cell extract was spun to remove cellular debris and the 
recovered supernatant fraction was subjected to affinity 

15 chromatography. Alternatively, the viscosity can be 

reduced by repeated passage of the cell extract through 
a syringe needle. 



Example 7 

20 Alternate cell lysis, affinity 

chromatography and enzyme cleavage. 

The protein purification construct product can 

be isolated by column or batch-wise affinity 

chromatography. coli cells containing the protein 

25 purification construct incorporating angiotensin were 
freeze-fractured or freeze-thawed to release catalytic 
amounts of lysozyme expressed by the plasmid vector. 
The lysozyme lysed the remainder of the cells. This 
process was carried out in 0.05 M-Tris -sulfate 7.5» It 

30 was centrifuged to 20,000 X g to remove any remaining 
particulate matter. 

The dialyzed material was then added to a 10 mL 
column of affinity resin prepared with sulfanilamide as 
described above and previously equilibrated to 0.05 M- 

35 Tris-sulfate pH 7.5. The column was eluted with the 
same buffer. The majority of the protein in the cell 
extract did not adhere to the column and came through in 
the void volume. The column was washed and until all 
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280 run absorbing material had eluted from the column. 
It was then washed with 0.05 M-Tris-sulfate/ 1 M-sodiiun 
sulfate pH 7.5 to remove any protein material which had 
adhered to the column by adventitious bonding. The 
5 column was then eluted with 0.2 M-potassium thiosulfate 
0.05 M-Tris-sulfate pH 6.5 to remove the protein 
purification construct, i.e., angiotensin. The p2X)tein 
purification construct was collected and dialyzed to 
remove the 0-2 M potassium thiosulfate against the 
10 desired buffer. This material was then subjected to an 
analysis by electrophoresis procedures which confirmed 
its identity and subsequently reacted with enterokinase 
to cleave the Asp^Lys bond to liberate the desired, 
cloned angiotensin. 

15 

Example 8 

Partition chromatography of 
protein purification construct 
peptide and cleavage enzyme 

20 Size exclusion chromatography was used to 

separate the cleavage enzyme from the desired peptide. 

Alternatively, an affinity substrate for either the 

peptide or the cleavage enzyme can be used. 

25 Example 9 

Construction of a Multi-unit Gene 
for Cholecv5tokinin*8 

This is an example of the construction of a 

multi-unit gene for the peptide cholecystokinin-8 . 

30 Two sets of oligonucleotides are synthesized. 

An oligonucleotide with the following sequence, 5'^ 
AAAGATTATATGGGTTGGATGGATTTTAAATAAGAT-3 ' , and an 
oligonucleotide with the complementary sequence 
constitute the first set. This DNA fragment codes for 

35 the cholecystokinin-8 peptide with an added C-terminal 
lysine. Downstream of the lysine codon (AAA) is a stop 
codon (TAA) followed by the first half of an EcoRV 
restriction enzyme site (GAT). The additional C- 
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terminal lysine constitutes the intraconnecting peptide. 
Upon phosphorylation and annealing of these oligos (see 
above examples), this double-stranded DN& fragment is 
ligated into the single EcoRV site of the fusion 
5 e35>ression vector p0304. Insertion of this fragment can 
be monitored by the presence of the internal Dral 
(TTTAAA) restriction enzyme site. Correctness of 
orientation can be verified by restriction enzyme 
analysis, since an EcoRV site will be reconstructed at 
10 the 3' end of the gene. At this point, the variable 
fused peptide consists of a single copy with an extra 
lysine at the C-terminal end. A third oligo with the 
sequence 5 ' -AAGGATTATAT666TT6GAT66ATTTT-3 ' and its 
complementary oligo are then^ phosphorylated, annealed 
15 and cloned into the Dral restriction enzyme site of the 
previously cloned DNA fragment. Only one Dral site will 
be restored upon insertion, since the codon for lysine, 
now at the 5' end, was changed to AAG (instead of AAA). 
The restored Dral site is therefore useful for 
20 determining the orientation of the inserted oligo. At 
this point, the variable fused peptide consists of two 
copies. These steps can be repeated until the desired 
copy number is obtained. 7he multicopy gene construct 
then codes for a large polypeptide which consists of a 
25 string of cholecystokinin-S peptides each linked 

together by a lysine residue. The large polypeptide can 
be released from the protein purification construct by 
enzymatic cleavage at the enterokinase recognition 
sequence. The individual cholecystokinin-8-lysine 
30 fusion product peptides can then be obtained upon 

trypsin cleavage. The cholecystokinin-8 peptide is then 
recovered from the lysine fusion product peptide through 
the action of carboxypeptidase. Free lysine can be 
removed by ion exchange chromatography on Dowex-50. 
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CLAIMS: 

1. A protein purification construct comprising; a 
binding protein or a modified functional version 
thereof and a variable fused polypeptide which are 
tandemly coupled together by an interconnecting 
peptide, the binding protein being an enzyme-like 
protein, wherein 

a) the vauriable fused polypeptide is a single 
peptide unit, or a group of the same or 
different peptide units tandemly linked 
together by an intracoimecting peptide, and the 
peptide unit is a sJLngle product peptide, or a 
group of the same -or different product peptides 
tandemly linked together by an innerconnecting 
peptide; 

b) the amino acid sequences of the inter, intra 
and innerconnecting peptides are not available 
for cleavage in the binding protein or product 
peptides ; and 

c) the binding protein exhibits highly specific 
binding to a low molecular weight ligand, has 
at least a single ligand binding site and a 
ligand solution dissociation constant of no 
greater than about 10"'m. 

2. A protein purification construct according to claim 
I wherein the binding protein is carbonic anhydrase, 
or a modified functional version thereof. 

3. A protein pxirif ication construct according to claim 
1 wherein the interconnecting peptide, the 
intraconnecting peptide, and the innerconnecting 
peptide are small peptides of up to about 10 amino 
acid units. 
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4. A protein purification construct according to claim 
1 wherein the interconnecting peptide, 
intraconnecting peptide, and innerconnecting peptide 
are individually and distinguishably selected from a 
peptide consisting of (Asp)4Lys, IleGluGlyArg, 
ArgGlyProArg, ArgGlyGly, AsnGly, AspPro, Met, 
AspGly, HisProPheHisLeuLeuValTyr, Lys, Arg, Phe, 
Tyr, Trp, Arg, and Glu. 

5. A protein purification construct according to claim 
1 wherein the variable fused polypeptide is a single 
product peptide or multiple units of said single 
product peptide tandemly liziked together by 
intraconnecting peptides; or the variable fused 
polypeptide is a single peptide unit of two to four 
different product peptides, the product peptides 
being tandemly linked together by innerconnecting 
peptides; or a group of said peptide units, the 
units being linked together by intraconnecting 
peptides • 

6. A recombinant gene comprising: a DNA sequence for a 
protein purification construct of a binding protein 
or modified functional version thereof and a 
variable fused polypeptide which are tandemly 
coupled together by an interconnecting peptide, the 
binding protein being an enzyme-like protein, 
wherein 

a) the variable fused polypeptide is a single 
peptide unit or a group of the same or 
different peptide units tandemly linked 
together by an intraconnecting peptide and the 
peptide unit is a single product peptide or a 
group of the same or different product peptides 
tandemly linked together by an innerconnecting 
peptide; 
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the amino acid sequences of the inter, intra 
and innerconnecting peptides are not available" : 
for cleavage in the binding protein or product 
peptides; and 

the binding protein exhibits highly specific 
binding to a low molecular weight ligand, has 
at least a single binding site and a ligand 
solution dissociation constant of no greater 
than about lO'^M. 

7. A recombinant gene according to claim 6 wherein the 
binding protein is carbonic anhydrase, or a modified 
functional version thereof. 

8. An expression vector comprising: a base vector with 
an inserted recombinant gene coding for a binding 
protein or modified functional version thereof and a 
variable fused polypeptide which are tandemly 
coupled together by an interconnecting peptide, the 
binding protein being an enzyme-like protein, 
wherein 

a) the variable fused polypeptide is a single 
peptide unit or a group of the same or 
different peptide units tandemly linked 
together by an intraconnecting peptide and the 
peptide unit is a single product peptide or a 
group of the same or different product peptides 
tandemly linked together by innerconnecting 
peptides ; and 

b) the binding protein exhibits highly specific 
binding to a small molecular weight ligand, has 
at least a single ligand binding site and a 
ligand solution dissociation constant of no 
greater than about 10"^11. 



b) 



c) 
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9. An expression vector according to claim 8 wherein 
the binding protein is carbonic anhydrase, or a 
modified functional version thereof. 

10. An expression vector according to claim 8 which is 
compatible with a host cell and contains a promoter, 
cm operator, and a ribosome binding site. 

11. An expression vector according to claim 8 which 
contains regulating regions for post-expression, 
intracellular manipulation of an expressed protein 
purification construct. 

12. A transformed cell comprising: a recombinant 
eukaryotic or prokaryotic cell containing an 
expression vector for a protein purification 
construct of a binding protein or a modified 
functional version thereof and a variable fused 
polypeptide which are tandemly coupled together by a 
interconnecting peptide, the binding protein being 
an enzyme-like protein, wherein 

the variable fused polypeptide is a single 
peptide unit or a group of the same or different 
peptide units tandemly linked together by an 
intraconnecting peptide and the peptide unit is a 
single product peptide or a group of the same or 
different product peptides tandemly linked together 
by interconnecting peptides. 

13. A method for expressing a protein purification 
construct of a binding protein or modified 
functional version thereof and a variable fused 
polypeptide which are tandemly coupled together by 
an interconnecting peptide and wherein the binding 
protein is an enzyme-like protein, comprising: 

inserting an expression vector carrying a 
recombinant gene for the protein purification 
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construct into a compatible host cell to produce a 
transformed host cell; 

manipulating a culture of transformed host 
cells to express the protein purification construct; 
and 

the variable fused polypeptide being a single 
peptide unit or a group of the same or different 
peptide units tandemly linked together by an 
intraconnecting peptide and the peptide unit is a 
single product peptide or a group of the same or 
different product peptides tandemly linked together 
by innerconnecting peptides. 

14. A method according to claim 13 comprising the 
additional step of selecting the appropriate 
transformed host cell after insertion. 

15. A method according to claim 13 wherein the binding 
protein is carbonic anhydrase/ or a modified 
functional version thereof. 

16. A method according to claim 13 wherein the 
manipulation step is accomplished by control of 
expression. 

17. A method for purifying a protein purification 
construct of a binding protein or modified 
functional version thereof and a variable fused 
polypeptide which are tandemly coupled together by 
an interconnecting peptide and wherein the binding 
protein is cm enzyme-like protein, comprising: 

contacting a mixture containing the protein 
purification construct with an immobilized ligand 
for the binding protein to bind the protein 
purification construct onto the immobilized 
inhibitor; 
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washing the bound protein purification 
construct to remove unbound substances; and 

the variable fused polypeptide being a single 
peptide unit or a group of the same or different 
peptide units tandemly linked together by an 
intraconnecting peptide and the peptide unit is a 
single product peptide or a group of the same or 
different product peptides tandemly linked together 
by innerconnecting peptides. 

18. A method according to claim 17 further comprising: 
treating the washed protein purification construct 
in bound or unbound condition with a cleavage 
reagent for the interconnecting peptide to produce 
free variable fused polypeptide. 

19. A method according to claim 17 wherein the binding 
protein exhibits highly specific binding to a low 
molecular weight ligand, has at least a single 
ligand binding site and a ligand solution 
dissociation constant of no greater than about 10"^M. 

20. A method according to claim 17 wherein the variable 
fused polypeptide is a single peptide unit of two to 
four different product peptides, the product 
peptides being tandemly linked together by 
innerconnecting peptides; or a group of said peptide 
units, the units being tandemly linked together by 
intraconnecting peptides, and the method further 
comprises the steps of: 

cleaving the intraconnecting peptides with a 
second enzyme or chemical to produce free peptide 
units , 

cleaving the innerconnecting peptides of the 
peptide units with a third enzyme or chemical to 
produce different free product peptides fused to 
peptide residue, 
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separating the different product peptides fused 
to peptides residues by employing the dif ferent, 
chemical, physical or antigenic properties of the 
product peptides, and 

cleaving the peptide residues with an 
exopeptidase to produce the different product 
peptides in separated form. 

21. A method for producing a protein purification 
construct of a binding protein or modified 
fxinctional version thereof and a variable fused 
polypeptide which are tandemly coupled together by 
an intercoimecting peptide and wherein the binding 
protein is an enzyme-like protein, comprising: 

inserting an expression vector carrying a 
recombinant gene for the protein purification 
construct into a compatible host cell to produce a 
transformed host cell; 

manipulating a culture of transformed host 
cells to express the protein purification construct 
in a mixture with culture medium; 

contacting the culture medium mixture 
containing the protein purification construct with 
an immobilized ligand for the binding protein to 
bind the protein purification construct onto the 
immobilized inhibitor; 

washing the boxind protein purification 
construct to remove unbound substances; and 

the variable fused polypeptide being a single 
peptide unit or a group of the same or different 
peptide units tandemly linked together by an 
intraconnecting peptide and the peptide unit is a 
single product peptide or a group of the same or 
different product peptides tandemly linked together 
by innerconnecting peptides. 
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FICURE 3 
Carbonic Anhydrase Secpienee 



MetSerHi sHi sTrpGl yTypGI yLysHI sAsn 

•ATGTCCCATCACTGG5SGTACGGCAAACACAAC 

_ 1 20 

Gl yProGl uHi sTrpHi sLysAsoPheProI 1 eAl aLysGl yGl uArgGl nSerProVal Asp 1 1 eAspThrHi s 
GGACCTGAGCACTGGCATAAGGACTTCCCCATTGCCAAGGGAGAGCGCCAGTCCCCTGTTGACATCGACACTCAT 

40 60 80 100 

ThrAl aLysTyrAspProSerLeuLysProLeuSerVal SerTyrAspGl nAl aThrSerLeuArgll eLeuAsn 
ACAGCCAAGTATGACCCTTCCCT6AAGCCCCTGTCTGTTTCCTA TGATCA AGCAACTTCCCTGA G6ATCCT CAAC 

120 140 Sau3AI BaroHi 

AsnGlyHisAI aPheAsnValGluPhsAspAspSer61nAsplysAUValLeuLysG1yG1yProLeuAspGly 
AATGGTCATGCTTTCAACGTGGAGTTTGATGACTCrCAGGftCAAAGCAGTGCTCAAGGGAGGACCCCTGGATQGC 

200 220 240 

ThrTyrArgLeuneGlnPheHisPheHisTrpGlySerLeuAspGlyGlnGlySerGluHisThrValAspLys 
ACmCAGAnGAmAGTTTCACmCACTGGGGmACTTGATSQACAAGGTTCAGAGCATACTGTGGATAAA 

280 300 320 

LysLysTyrAl aAl aGl uLeuHi sLeuVal Hi sTrpAsnThrLysTyrGI yAspPheGl yLysAI aVal Gl nGl n 
AAGAAATAT GCTGCAGA ACTTCACTTGGTTCACTGGAACACCAAATATGGGGATTTTGGGAAAGCTGTGCAG^^ 

340 PstI 360 380 400 

ProAspGlyLeuAl aVal LeuGlyll ePheLeuLysVal GlySerAl aLysProGl yLeuGl nLysVal Val Asp 
CCTGATGGACTGGCCGTTCTAGGTATTTTTTTGAAGGTTGGCAGCGCTAAACCGGGCCTTCAGAAAGmTTGAT . 

420 440 450 480 

Val LeuAspSerll eLysThrLysGlyLysSerAl aAspPheThrAsnPheAspProArgGlyLeuLeuPpoQl u 
GTGCTSGAmCATTAAAACAAAGGGCAAGAGTGCTGACnCACTAACTTCGATCCTCGTGGCCTCC^^ 

500 520 Sau3AI 

SerLeuAsoTyrTrpThrTyrProGlySarLeuThrThrProProLeuLeuGl uCysVal ThrTrpIl eVal Leu 
TCCTTGGAmCTGGACCTACCCAGGCTCACTGACCACCCCTCCrcnTCTGGAATGTGTGACCTGGATr^ 

580 600 620 

LysGluProIleSerValSerSerGluGlnValLeuLysPheArgLysLeuAsnPheAsnGlyGluGlyGluPro 
AAGGAACCCATCAGCGTCAGCAGCGAGCAGGTGTTGAAATTCCGTAAACTTAACTTCAATGGGGAGGG7GAACCC 

640 650 680 700 

Gl uGluLeuMetVal AspAsnTrpArgProAl aGl "P^LeuLysAsnArgGlnn eLysAT^^ 
GAAGAACTGATGGTGGACAACTGGCGCCCAGCTCAGCCAC7GAAGAACAGGCAAA i CAAASCi^CwTTCAAA 
itn 740 760 Hindlii 780 



Nucleotide seouencs and deduced amino acid sequenc of 
hunan carbonic anhydrase II cDNA. The sequence is 
taken froai Petersen and Lund, Acta Che mica Scand., 342 , 
319-323 (1988). 
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